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Preface 



I intend this book to be, firstly, a introduction to calculus based on the hy- 
perreal number system. In other words, I will use infinitesimal and infinite 
numbers freely. Just as most beginning calculus books provide no logical jus- 
tification for the real number system, I will provide none for the hyperreals. 
The reader interested in questions of foundations should consult books such as 
Abraham Robinson's Non-standard Analysis or Robert Goldblatt's Lectures on 
the Hyperreals. 

Secondly, I have aimed the text primarily at readers who already have some 
familiarity with calculus. Although the book does not explicitly assume any 
prerequisites beyond basic algebra and trigonometry, in practice the pace is 
too fast for most of those without some acquaintance with the basic notions of 
calculus. 
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Chapter 1 

Derivatives 

1.1 The arrow paradox 

In his famous arrow paradox, Zeno contends that an arrow cannot move since 
at every instant of time it is at rest. There are at least two logical problems 
hidden in this claim. 

1.1.1 Zero divided by zero 

In one interpretation, Zeno seems to be saying that, since at every instant of 
time the arrow has a definite position, and hence does not travel any distance 
during that instant of time, the velocity of the arrow is 0. The question is, if an 
object travels a distance in time of duration 0, is the velocity of the object 0? 
That is, is 

5 = 0? (1.1.1) 

To answer this question, we need to examine the meaning of dividing one 
number by another. If o and b are real numbers, with b ^ 0, then 

l=c (1.1.2) 

means that 

a = bxc. (1.1.3) 

In particular, for any real number 67^0, 





(1.1.4) 



since 6x0 = 0. Note that if a ^ 0, then 

a 



(1-1.5) 
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is undefined since there does not exist a real number c for which x c is equal 
to a. We say that division of a non-zero number by zero is meaningless. On the 
other hand, 

I (1-1.6) 

is undefined because x c = for all real numbers c. For this reason, we say 
that division of zero by zero is indeterminate. 

The first logical problem exposed by Zeno's arrow paradox is the problem 
of giving determinate meaning to ratios of quantities with zero magnitude. We 
shall see that infinitesimals give us one way of giving definite meanings to ratios 
of quantities with zero magnitudes, and these ratios will provide the basis for 
what we call the differential calculus. 



1.1.2 Adding up zeroes 

Another possible interpretation of the arrow paradox is that if at every instant 
of time the arrow moves no distance, then the total distance traveled by the 
arrow is equal to added to itself a large, or even infinite, number of times. 
Now if n is any positive integer, then, of course, 

nxO = 0. (1-1.7) 

That is, zero added to itself a finite number of times is zero. However, if an 
interval of time is composed of an infinite number of instants, then we are asking 
for the product of infinity and zero, that is, 

oo x 0. (1.1.8) 

One might at first think this result should also be zero; however, more careful 
reasoning is needed. 

Note that an interval of time, say the interval [0, 1], is composed of an infinity 
of instants of no duration. Hence, in this case, the product of infinity and 
must be 1, the length of the interval. However, the same reasoning applied to 
the interval [0,2] would lead us to think that infinity times is 2. Indeed, as 
with the problem of zero divided by 0, infinity times is indeterminate. 

Thus the second logical problem exposed by Zeno's arrow paradox is the 
problem of giving determinate meaning to infinite sums of zero magnitudes, or, 
in the simplest cases, to products of infinitesimal and infinite numbers. 

Since division is the inverse operation of multiplication we should expect a 
close connection between these questions. This is in fact the case, as we shall 
see when we discuss the fundamental theorem of calculus. 



1.2 Rates of change 

Suppose x(t) gives the position, at some time t, of an object (such as Zeno's 
arrow) moving along a straight line. The problem we face is that of giving a 
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determinate meaning to the idea of the velocity of the object at a specific instant 
of time. We first note that we face no logical difficulties in defining an average 
velocity over an interval of time of non-zero length. That is, if a < b, then the 
object travels a distance 

Ax = x(b)-x(a) (1-2.1) 

from time t = a to time t = b, an interval of time of length At = b — a, and, 
consequently, the average velocity of the object over this interval of time is 

x(b) — x(a) Ax 

Example 1.2.1. Suppose an object, such as a lead ball, is dropped from a 
height of 100 meters. Ignoring air resistance, the height of the ball above the 
earth after t seconds is given by 

x(t) = 100 - AM 2 meters, 

a result first discovered by Galileo. Hence, for example, from time t = to time 
t = 2 we have 

Ax = x{2) - x{0) = (100 - (4.9)(4)) - 100 = -19.6 meters, 

At = 2 - = 2 seconds, 

and so 

19.6 
^[0,2] = = — 9.8 meters/second. 

For another example, from time t = 1 to time t = 4 we have 
Ax = x(4) - x{\) = 21.6 - 95.1 = -73.5, 

At = 4—1 = 3 seconds, 

and so 

73 5 

u [i.4l = = — 24.5 meters/second. 

o 

Note that both of these average velocities are negative because we have taken 
the positive direction to be upward from the surface of the earth. 

Exercise 1.2.1. Suppose a lead ball is dropped into a well. Ignoring air resis- 
tance, the ball will have fallen a distance x{t) = 16i 2 feet after t seconds. Find 
the average velocity of the ball over the intervals (a) [0,2], (b) [1,3], and (c) 
[1,1.5]. 

Letting At = b — a, we may rewrite (1.2.2) in the form 

x{a + At) -x{a) 

V[a, a +At] = 2U ' (1-2.3) 
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Using (1.2.3), there are two approaches to generalizing the notion of average 
velocity over an interval to that of velocity at an instant. The most common 
approach, at least since the middle of the 19th century, is to consider the effect 
on U[a,a+At] as At diminishes in magnitude and defining the velocity at time 
t = a to be the limiting value of these average velocities. The approach we 
will take in this text is to consider what happens when we take a and b to be, 
although not equal, immeasurably close to one another. 

Example 1.2.2. If we have, as in the previous example, 

x(t) = 100 - 4.9t 2 meters, 

then from time t = 1 to time t = 1 + At we would have 

Aa; = x(l + At) - x(l) 

= (100-4.9(1 + At) 2 )- 95.1 
= 4.9- 4.9(1 + 2At+ {At) 2 ) 
= -9.8At - 4.9(A£) 2 meters. 

Hence the average velocity over the interval [1,1 + At] is 

Aa; 

«[1,1+At] = -£ 

_ -9.8At-4.9{At) 2 

~ At 

= —9.8 — A.9At meters/second. 

Note that if, for example, At = 3, then we find 

V[1A] = -9.8 - (4.9)(3) = -9.8 - 14.7 = -24.5 meters/second, 

in agreement with our previous calculations. 

Now suppose that the starting time a = 1 and the ending time b are different, 
but the difference is so small that it cannot be measured by any real number. 
In this case, we call dt = b — a an infinitesimal . Similar to our computations 
above, we have 

dx = x(l + dt) - x(l) = -9.8dt - A.9(dt) 2 meters, 

the distance traveled by the object from time t = 1 to time t = 1 + dt, and 

dx 
V[i.i+dt] = -~r = —9.8 — 4.9<H meters/second, 

the average velocity of the object over the interval [1, 1 + dt]. However, since dt 
is infinitesimal, so is A.9dt. Hence Um i+dt] is immeasurably close to —9.8 meters 
per second. Moreover, this is true no matter what the particular value of dt. 
Hence we should take the instantaneous velocity of the object at time t = 1 to 
be 

v(l) = —9.8 meters/second. 
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Exercise 1.2.2. As in the previous exercise, suppose a lead ball has fallen 
x(t) = 16t 2 feet in t seconds. Find the average velocity of the ball over the 
interval [1,1 + At] and use this result to obtain the answers to parts (b) and (c) 
of the previous exercise. 

Exercise 1.2.3. Find the average velocity of the ball in the previous exercise 
over the interval [1,1 + dt] , where dt is infinitesimal, and use the result to find 
the instantaneous velocity of the ball at time t = 1 . 

Example 1.2.3. To find the velocity of the object of the previous examples at 
time t = 3, we compute 

dx = x(S + dt) — x(3) 

= (100 - 4.9(3 + dt) 2 - 55.9 
= 44.1- 4.9(9 + 6dt+ {dt 2 }) 
= -29 Adt - A.9(dt) 2 meters, 

from which we obtain 

dx 

— = —29.4 — 4.9dt meters/second. 

As above, we disregard the immeasurable — 4.9dt to obtain the velocity of the 
object at time t = 3: 

v(3) = — 29.4meters/second. 

Exercise 1.2.4. Find the velocity of the ball in the previous exercise at time 
t= 2. 

In general, if x(t) gives the position, at time t, of an object moving along a 
straight line, then we define the velocity of the object at a time t to be the real 
number which is infinitesimally close to 

x(t + dt)-x(t) 

dt ' ( ' 

provided there is exactly one such number for any value of the nonzero infinites- 
imal dt. 

Example 1.2.4. For our previous example, we find 

dx = x(t + dt) — x(t) 

= (100 - 4.9(t + dt) 2 ) - (100 - AM 2 ) 
= -4.9(t + 2tdt + (dt) 2 ) - 4.9t 2 
= -9Mdt - A.9(dt) 2 meters 
= (-9.8t - A.9dt)dt. 
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Hence 

dx 

— = — 9.8£ — 4.9dt meters/second, 

and so the velocity of the object at time t is 

v{t) = — 9.8i meters/second. 



In particular, 
and 



v(l) = —9.8 meters/second 



v(3) = -9.8(3) = -29.4 meters/second, 
as previously computed. 

Exercise 1.2.5. Find the velocity of the ball in the previous exercise at time 
t. Use your result to verify your previous answers for v(l) and v(2). 

Even more generally we should recognize that velocity is but a particular 
example of a rate of change, namely, the rate of change of the position of an 
object with respect to time. In general, given any quantity y as a function of 
another quantity x, say y = f(x) for some function /, we may ask about the 
rate of change of y with respect to i. If a; changes from x = a to x = b and we 
let 

Ax = b-a (1.2.5) 

and 

Ay = f(b) - f(a) = f(a + Ax) - f(x), (1.2.6) 

then 

Ax b — a 

is the average rate of change of y with respect to x; if dx is a nonzero infinites- 
imal, then the real number which is infinitesimally close to 

dy = f{x + dx) - f{x) 
dx dx 

is the instantaneous rate of change, or, simply, rate of change, of y with respect 
to x at x = a. In subsequent sections we will look at this quantity in more 
detail, but will consider one more example before delving into technicalities. 

Example 1.2.5. Suppose a spherical shaped balloon is being filled with water. 
If r is the radius of the balloon in centimeters and V is the volume of the balloon, 
then 

4 o q 

V = —irr centimeters . 
3 

Since a cubic centimeter of water has a mass of 1 gram, the mass of the water 

in the balloon is 

4 , 
M = —irr grams. 
3 S 
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To find the rate of change of the mass of the balloon with respect to the radius 
of the balloon, we first compute 

4 o 4 , 

dM = -ir(r + dry irr 6 

= -7r((r 3 + 3r 2 dr + 3r(dr) 2 + (dr) 3 ) - r 3 ) 



-7r(3r + 3rdr + (dr) )dr grams, 



from which it follows that 

= -7r(3r + 3rdr + (dr) ) grams/centimeter. 

dr 3 

Since both 3rdr and (dr) 2 are infinitesimal, the rate of change of mass of the 
balloon with respect to the radius of the balloon is 

-7r(3r ) = 4irr gams/centimeer. 
o 

For example, when the balloon has a radius of 10 centimeters, the mass of the 
water in the balloon is increasing at a rate of 

47r(10) = 4007T grams/centimeter. 

It may not be surprising that this is also the surface area of the balloon at that 
instant. 

Exercise 1.2.6. Show that if A is the area of a circle with radius r, then 
W = 2*r. 



1.3 The hyperreals 

We will let R denote the set of all real numbers. Intuitively, and historically, we 
think of these as the numbers sufficient to measure geometric quantities. For 
example, the set of all rational numbers, that is, numbers expressible as the 
ratios of integers, is not sufficient for this purpose since, for example, the length 
of the diagonal of a square with sides of length 1 is the irrational number V2. 
There are numerous technical methods for defining and constructing the real 
numbers, but, for the purposes of this text, it is sufficient to think of them as 
the set of all numbers expressible as infinite decimals, repeating if the number 
is rational and non-repeating otherwise. 

A positive infinitesimal is any number e with the property that e > and 
e < r for any positive real number r. The set of infinitesimals consists of the 
positive infinitesimals along with their additive inverses and zero. Intuitively, 
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these are the numbers which, except for 0, correspond to quantities which are 
too small to measure even theoretically. Again, there are technical ways to make 
the definition and constrution of infinitesimals explicit, but they lie beyond the 
scope of this text. 

The multiplicative inverse of a nonzero infinitesimal is an infinite number. 
That is, for any infinitesimal e^ 0, the number 

N='- 
e 

is an infinite number. 

The finite hyperreal numbers are numbers of the form r + e, where r is a real 
number and e is an infinitesimal. The hyperreal numbers, which we denote *K, 
consist of the finite hyperreal numbers along with all infinite numbers. 

For any finite hyperreal number a, there exists a unique real number r for 
which a = r + e for some infinitesimal e. In this case, we call r the shadow of a 
and write 

r = sh(a). (1.3.1) 

Alternatively, we may call sh(a) the standard part of a. 

We will write a ~ b to indicate that a — b is an infinitesimal, that is, that a 
and b are infinitesimally close. In particular, for any finite hyperreal number a, 
a ~ sh(a). 

It is important to note that 

• if e and 5 are infinitesimals, then so is e + 5, 

• if e is an infinitesimal and a is a finite hyperreal number, then at is an 
infinitesimal, and 

• if e is a nonzero infinitesimal and a is a hyperreal number with sh(o) 7^ 
(that is, a is not an infinitesimal), then - is infinite. 

These are in agreement with our intuition that a finite sum of infinitely small 
numbers is still infinitely small and that an infinitely small nonzero number will 
divide into any noninfinitesimal quantity an infinite number of times. 

Exercise 1.3.1. Show that sh(a + 6) =sh(a) + sh(6) and sh(ab) =sh(a)sh(6), 
where a and b are any hyperreal numbers. 

Exercise 1.3.2. Suppose a is a hyperreal number with sh(a) 7^ 0. Show that 

\aJ sh(a) 

1.4 Continuous functions 

As (1.2.8) indicates, we would like to define the rate of change of a function 
y = f(x) with respect to x as the shadow of the ratio of two quantities, dy = 
f(x + dx) — f{x) and dx, with the latter being a nonzero infinitesimal. From 



1.4. CONTINUOUS FUNCTIONS 9 

the discussion of the previous section, it follows that we can do this if and only 
if the numerator dy is also an infinitesimal. 

Definition 1.4.1. We say a function / is continuous at a real number c if for 
every infinitesimal e, 

/( c +e)~/(c) (1.4.1) 

Note that f(c + e) ~ /(c) is equivalent to /(c + e) — /(c) ~ 0, that is, 
f(c + e) — /(c) is an infinitesimal. In other words, a function / is continuous 
at a real number c if an infinitesimal change in the value of c results in an 
infinitesimal change in the value of /. 

Example 1.4.1. If f(x) = x 2 , then, for example, for any infinitesimal e, 

/(3 + e) = (3 + e) 2 = 9 + 6e + e 2 ~ 9 = /(3). 
Hence / is continuous at x = 3. More generally, for any real number x, 

f(x + e) = (x + e) 2 = x 2 + 2xe + e 2 ~ x 2 = f(x), 
from which it follows that / is continuous at every real number x. 

Exercise 1.4.1. Verify that f(x) = 3x + 4 is continuous at x = 5. 

Exercise 1.4.2. Verify that g(t) = t 3 is continuous at t = 2. 

Given real numbers a and b, we let 

(a, b) = {x | x is a real number and a < x < b}, (1.4.2) 

(a, oo ) = {x | x is a real number and x > a}, (1.4.3) 

(—00, b) = {x | x is a real number and a: < 6}, (1-4-4) 

and 

(-00,00) =R. (1.4.5) 

An open interval is any set of one of these forms. 

Definition 1.4.2. We say a function / is continuous on an open interval / if 
/ is continuous at every real number in /. 

Example 1.4.2. From our example above, it follows that f(x) = x 2 is contin- 
uous on (—00, 00) . 

Exercise 1.4.3. Verify that f(x) = 3x + 4 is continuous on (—00,00). 
Exercise 1.4.4. Verify that g(t) = t 3 is continuous on (—00,00). 
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Figure 1.4.1: Graph of the Heaviside function 



Example 1.4.3. We call the function 

H(t) 



0, if * < 0, 

1, if * > 0, 



the Heaviside function (see Figure 1.4.1). If e is a positive infinitesimal, then 

H(0 + e)=H{e) = l = H{0), 

whereas 

if(O-e) =i?(-e) = 0. 

Since is not infinitesimally close to 1, it follows that H is not continuous at 
0. However, for any positive real number a and any infinitesimal e (positive or 
negative), 

H{a + e) = 1 = H{a), 

since a + e > 0, and for any negative real number a and any infinitesimal e, 

H(a + e) = = H{a), 
since a + e < 0. Thus J? is continuous on both (0, oo) and (—00, 0). 

Note that, in the previous example, the Heaviside function satisfies the con- 
dition for continuity at for positive infinitesimals but not for negative infinites- 
imals. The following definition addresses this situation. 

Definition 1.4.3. We say a function / is continuous from the right at a real 
number c if for every infinitesimal e > 0, 



/( c + e)~/(c). 



(1.4.6) 
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Similarly, we say a function / is continuous from the left at a real number c if 
for every infinitesimal e > 0, 

/(c-e)~/(c). (1.4.7) 

Example 1.4.4. In the previous example, H is continuous from the right at 
t = 0, but not from the left. 

Of course, if / is continuous both from the left and the right at c, then / is 
continuous at c. 

Example 1.4.5. Suppose 

)32: + 5, if x < 1, 
K ' \ 10 -2x, if x>l. 

If e is a positive infinitesimal, then 

/(l + e ) = 3(1 + e) + 5 = 8 + 3e ~ 8 = /(l), 
so / is continuous from the right at x = 1, and 

/(l - c) = 3(1 - e) +5 = 8 - 3e ~ 8 = /(l), 
so / is continuous from the left at x = 1 as well. Hence / is continuous at x = 1. 



Exercise 1.4.5. Verify that the function 

U(t)- 



0, if t < 0, 

1, if < t < 1, 
0, if * > 1, 



is continuous from the right at t = and continuous from the left at t = 1, but 
not continuous at either t = or t = 1. See Figure 1.4.2. 

Given real numbers a and b, we let 

[a, 6] = {x I 2; is a real number and a < x < 6}, (1.4.8) 

[0, 00) = {2; I x is a real number and x > a}, (1.4.9) 

and 

(— 00, 6] = {2 I a; is a real number and x < &}. (1.4.10) 

A closed interval is any set of one of these forms. 

Definition 1.4.4. If a and 6 are real numbers, we say a function / is contin- 
uous on the closed interval [a, b] if / is continuous on the open interval (a, b), 
continuous from the right at a, and continuous from the left at b. We say / is 
continuous on the closed interval [a, 00) if / is continuous on the open interval 
(a, 00) and continuous from the right at a. We say / is continuous on the closed 
interval (—00, 6] if / is continuous on (—00, b) and continuous from the left at b. 
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Figure 1.4.2: Graph of y = U(t) from Exercise 1.4.5 

Example 1.4.6. We may summarize our results about the Heaviside function 
as H is continuous on (— oo,0) and on [0, oo). 

Exercise 1.4.6. Explain why the function U in the previous exercise is con- 
tinuous on the intervals (— oo,0), [0,1], and (l,oo), but not on the interval 

(—00,00). 



1.5 Properties of continuous functions 

Suppose / is continuous at the real number c and k is any fixed real number. If 
we let h(x) = kg(x), then, for any infinitesimal e, 



h(c + e) - h(c) = kf(c + e) - kf(c) = k(f(c + e) - /(c)) 



(1.5.1) 



is an infinitesimal since, by assumption, /(c+e) — /(c) is an infinitesimal. Hence 
h(c+e) ~ h(c). 

Theorem 1.5.1. If / is continuous at c and k is any fixed real number, then 
the function h(x) = kf(x) is also continuous at c. 

Example 1.5.1. We have seen that /(x) = x 2 is continuous on (—00,00). It 
now follows that, for example, g{x) = 5x 2 is also continuous on (—00,00). 

Suppose that both / and g are continuous at the real number c and we let 
s(x) = f(x) + g(x). If e is any infinitesimal, then 



s(c + e) = /(c + e) + g(c + e) ~ /(c) + g(c) = s{c), 
and so s is also continuous at c. 



(1.5.2) 
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Theorem 1.5.2. If / and g are both continuous at c, then the function 

s{x) = f(x)+g(x) 

is also continuous at c. 
Example 1.5.2. Since 

(x + e) 3 = x 3 + 3x 2 e + 3xe 2 + e 3 ~ x 3 

for any real number x and any infinitesimal e, it follows that g{x) = x 3 is 
continuous on (—00,00). From the previous theorems, it then follows that 

h(x) = 5x 2 + 3x 3 

is continuous on (—00,00). 

Again, suppose / and g are both continuous at c and let p(x) = f(x)g(x). 
Then, for any infinitesimal e, 

P {c + e)-s(c) = /(c + e)g(c + e) - f{c)g{c) 

= f(c + e)g(c + e) - f(c)g(c + e) + f(c)g(c + e) - f(c)g(c) 

= g ( c +e)(f(c+e) - /(c)) + f(c)(g(c+e) - g{c)), (1.5.3) 

which is infinitesimal since both /(c + e) — /(c) and g(c + e) — g(c) are. Hence 
p is continuous at c. 

Theorem 1.5.3. If / and g are both continuous at c, then the function 

p(x) = f{x)g{x) 
is also continuous at c. 

Finally, suppose / and g are continuous at c and g(c) 7^ 0. Let g(a;) = ^y. 
Then, for any infinitesimal e, 

g(c + e) - g(c) = — -— 

9{c + e) g{c) 

_ f( c +e)g(c)-f(c)g(c+e) 

g{c+e)g(c) 

f(c + e)g(c) - f(c)g(c) + f(c)g(c) - f(c)g(c + e) 



g(c+e)g(c) 
g(c)(f(c + e) - /(c)) - /(c)( g (c + e) - g(c)) 
ff(c+e)ff(c) 



(1.5.4) 



which is infinitesimal since both /(c + e) — /(c) and g(c + e) — g(c) are in- 
finitesimals, and g(c)g(c + e) is not an infinitesimal. Hence q is continuous at 
c. 
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Theorem 1.5.4. If / and g are both continuous at c and g(c) ^ 0, then the 
function 

< ^ f ^ 
is continuous at c. 
Exercise 1.5.1. Explain why 



x 2 + l 
is continuous on (—00,00). 

1.5.1 Polynomials and rational functions 

It is now possible to identify two important classes of continuous functions. 
First, every constant function is continuous: indeed, if f(x) = k for all real 
values x, and k is any real constant, then for any infinitesimal e, 

f(x + e) = k = f(x). (1.5.5) 

Next, the function f(x) = x is continuous for all real x since, for any infinitesimal 

f(x + e) = x + e~x = f(x). (1.5.6) 

Since the product of continuous functions is continuous, it now follows that, for 
any nonnegative integer n, g(x) = x n is continuous on (—00,00) since it is a 
constant function if n = and a product of f(x) = x by itself n times otherwise. 

From this it follows (since constant multiples of continuous functions are 
again continuous) that all monomials, that is, functions of the form f(x) = ax n , 
where a is a fixed real constant and n is a nonnegative integer, are continuous. 

Now a polynomial is a function of the form 

P(x) = ao + <X\X + a 2 x 2 + ■ ■ ■ + a n x n , (1.5.7) 

where do, &i, • • • , a n are real constants and n is a nonnegative integer. That 
is, a polynomial is a sum of monomials. Since sums of continuous functions are 
continuous, we now have the following fundamental result. 

Theorem 1.5.5. If P is a polynomial, then P is continuous on (00, 00). 

Example 1.5.3. The function 

f(x) = 32 + 14a; 5 - 6a; 7 + ttx 14 

is continuous on (—00,00). 
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A rational function is a ratio of polynomials. That is, if P(x) and Q(x) are 
polynomials, then 

P(r) 
R(x) = -=)-( (1.5.8) 

is a rational function. Since ratios of continuous functions are continuous, we 
have the following. 

Theorem 1.5.6. If R is rational function, then R is continuous at every point 
in its domain. 

Example 1.5.4. If 

ft \ 3a: ~ 4 
x l — 1 

then / is a rational function defined for all real x except x = — 1 and x = 1. 
Thus / is continuous on the intervals (— oo, — 1), (—1,1), and (1, oo). 

Exercise 1.5.2. Find the intervals on which 

ft \ *x 2 -l 

is continuous. 

1.5.2 Trigonometric functions 

Recall that if t is a real number and (a, b) is the point in the plane found by 
traversing the unit circle x 1 + y 2 = 1 a distance \t\ from (1,0), in the counter- 
clockwise direction if t > and in the clockwise direction otherwise, then 

a = cos(i) (1.5.9) 

6 = sin(t). (1.5.10) 

Note that for < t < ir, as in Figure 1.5.1, £ is greater than the length 
of the line segment from A = (1,0) to B = (cos (t), sin (£)). Now the segment 
from ^4 to _B is the hypotenuse of the right triangle with vertices at A, B, and 
C = (cos(£),0). Since the distance from C to A is 1 — cos(f)) and the distance 
from B to C is sin(£), it follows from the Pythagorean theorem that 

t 2 > (l-cos(£)) 2 +sin 2 (£) 

= 1-2 cos(t) + cos 2 (t) + sin 2 (£) 

= 2-2cos(t). (1.5.11) 

A similar diagram reveals the same result for — ir < t < 0. Moreover, both t 2 
and 2 — 2 cos(t) are when t = 0, so we have t 2 > 2 — 2 cos(£) for all — n < t < it. 
Additionally, < 2 - 2cos(£) < 4 for all t (since -1 < cos(i) < 1 for all t), 
so certainly t 2 > 2 — 2cos(£) whenever |i| > 2. Hence we have shown that 

<2-2cos{t) <t 2 (1.5.12) 



16 
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-1 -0.75 -0.5 -0.25 0.25 0.5 



X 2 + y 2 = 1 



B = (cos(i),sin(£)) 




0.75 1 



Figure 1.5.1: An arc of length t on the unit circle 



for all values of t. Equivalently, 



< 1 - cos ft) < -t 2 

w - 2 

Solving for cos(t), we may also write this as 



(1.5.13) 



1 < cosft) < 1 

for all t. 

In particular, if e is an infinitesimal, then 

1 < cos(e) < 1 

2 - w - 

implies that 

cos(0 + e) = cos(e) ~ 1 = cos(0). 

That is, the function f(t)= cos(t) is continuous at t = 0. 
Moreover, since < 1 + cos(i) < 2 for all t, 

sin 2 (i) = 1 -cos 2 (t) 

= (l-cos(t))(l + cos(t)) 

<y(l + cos(t)) 



(1.5.14) 



<* 2 , 



from which it follows that 



(1.5.15) 
(1.5.16) 



|8in(t)|<|*| 



(1.5.17) 
(1.5.18) 
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for any value of t. In particular, for any infinitesimal e, 

|sin(0 + e)| = |sin(e)| < e, (1.5.19) 

from which it follows that 

sin(e) ~ = sin(O). (1.5.20) 

That is, the function g(t) = sin(i) is continuous at t = 0. 

Using the angle addition formulas for sine and cosine, we see that, for any 
real number t and infinitesimal e, 

cos(i + e) = cos(t) cos(e) — sin(i) sin(e) ~ cos(t) (1.5.21) 

and 

sin(£ + e) = sin(t) cos(e) + cos(t) sin(e) ~ sin(t), (1.5.22) 

since cos(e) ~ 1 and sin(e) is an infinitesimal. Hence we have the following 
result. 

Theorem 1.5.7. The functions fit) = cos(i) and g(t) = sin(i) are continuous 
on (-co, oo). 

The following theorem now follows from our earlier results about continuous 
functions. 

Theorem 1.5.8. The following functions are continuous at each point in their 
respective domains: 

tan(t) = ^W (1.5.23) 

cos(i) 

. . cos(i) 
cot(t) = -^, (1.5.24) 

sec(t) = ^, (1.5.25) 

csc(^) = -*—. (1.5.26) 

sm(i) 

With a little more geometry, we may improve upon the inequalities in 
(1.5.13) and (1.5.18). Consider an angle < t < f , let A = (1,0) and 
B = (cos(i), sin(i)) as above, and let D be the point of intersection of the 
lines tangent to the circle x 2 + y 2 = 1 at A and B (see Figure 1.5.2). Note 
that the triangle with vertices at A, B, and D is isosceles with base of length 
-\/2(l — cos(i)) (as derived above) and base angles |. Moreover, the sum of the 
lengths of the two legs exceeds t. Since each leg is of length 



|ygl -cosffl) 

cos (|) 



(1.5.27) 
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D = (cos(t),sin(i)) 




0.25 0.5 0.75 1 



Figure 1.5.2: An upper bound for the arc length t 



it follows that 



t < 



y/2(l - COB(f)) 
COS (|) 



(1.5.28) 



Moreover, both sides of this inequality are when t = and we could derive 
the same inequality for -| < t < 0, so we have 



I < 



v/2(l - cos(t)) 



58(1) 



for all — ^ < t < -| . It now follows that 



1 2 1 - cos(t) _ 2(1 - cos(i)) 



cos- 



(§) l + cos(t) 



for all — § < £ < § , where we have used the half-angle identity 



(1.5.29) 



(1.5.30) 



1 + cos(t) 



2) 2 

Combining (1.5.30) with (1.5.13), we have 

, ,. w 1,2^ 2(l-cos(t)) 

1 — cos(i) < -t < r^- 

w ~ 2 ~ 1 + cos(i) 

from which we obtain, for all — ? < £ < 5 , 

1 + cos(t) 1 — cos(i) 
2 ^^ 



(1.5.31) 



(1.5.32) 



(1.5.33) 
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Figure 1.5.3: Comparing y = 1 — cos(t) with y = ^t 2 

Now if e is an infinitesimal, then 1 + cos(e) ~ 2, and so 

1 + cos(e) 
2 ~ 

Hence, substituting t = e in (1.5.33), we have 

1 — cos(e) 

— ~ 1 

2 fc 

Moreover, we then have 

sin 2 (e) 1 — cos 2 (e) 1 — cos(e) 



1 



e* e* a* 2 

Since e and sin(e) have the same sign, it follows that 

shi(e) _ ^ 



(1.5.34) 



(l + cos(i))~ -(2) = 1. (1.5.35) 



(1.5.36) 



For real numbers t, (1.5.34) and (1.5.36) say that, for small values of t, 



and 



cos(£) w 1 



sin(i) w i. 



(1.5.37) 



(1.5.38) 



Figures 1.5.3 and 1.5.4 graphically display the comparisons in (1.5.37) and 
(1.5.38). 
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Figure 1.5.4: Comparison of y = sin(£) with y = t 

Example 1.5.5. For a numerical comparison, note that for t = 0.1, cos(t) = 
0.9950042, compared to 1 - £ = 0.995, and sin(i) = 0.0998334, compared to 
£ = 0.1. 

Exercise 1.5.3. Verify that the triangle with vertices at A, B, and D in Figure 
1.5.2 is an isosceles triangle with base angles of | at A and B. 

Exercise 1.5.4. Verify the half-angle formula, 

cos(0) = -(l + cos(20)), 

for any angle 0, using the identities cos(20) = cos 2 (0) — sin (6) (a consequence 
of the addition formula) and sin 2 (0) + cos 2 (0) = 1. 



1.5.3 Compositions 

Given functions / and g, we call the function 

f °g{x) = f{g(x)) 



(1.5.39) 



the composition of / with g. If g is continuous at a real number c, / is continuous 
at <?(c), and e is an infinitesimal, then 



fog(c+e) = f{g(c+e))~f(g(c)) 



(1.5.40) 



since g{c + e) ~ g{c). 
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Theorem 1.5.9. If g is continuous at c and / is continuous at g(c), then fog 
is continuous at c. 

Example 1.5.6. Since f(t) = sin(i) is continuous for all t and 

3t 2 + 1 



At -8 
is continuous at all real numbers except t = 2, it follows that 

, m • z ^ + r 

hit) = sin 

w V to - 8 

is continuous on the intervals (— oo,2) and (2, oo). 

Note that if f(x) = \fx and e is an infinitesimal, then, for any x ^ 0, 
f(x + e) - /(x) = Vx + e - \/x 






x + e — x 

\Jx + e + y/x 
e 

\Jx + e + y/x' 

which is infinitesimal. Hence / is continuous on (0,oo). Moreover, if e is a 
positive infinitesimal, then yft must be an infinitesimal (since if a = y/t is not 
an infinitesimal, then a 2 = e is not an infinitesimal) . Hence 

/(0 + e) = ^-0=/(0). 

Thus / is continuous at 0, and so f(x) = \fx is continuous on [0, oo). 

Theorem 1.5.10. The function f(x) = \fx is continuous on [0,oo). 

Example 1.5.7. It now follows that fix) = \/4x — 2 is continuous everywhere 
it is defined, namely, on [2, oo). 

Exercise 1.5.5. Find the interval or intervals on which fix) = sin(-) is 
continuous. 

Exercise 1.5.6. Find the interval or intervals on which 

9(t) 



1 + t 2 



1-t 
is continuous. 
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1.5.4 Consequences of continuity 

Continuous functions have two important properties that will play key roles 
in our discussions in the rest of the text: the extreme- value property and the 
intermediate- value property. Both of these properties rely on technical aspects 
of the real numbers which lie beyond the scope of this text, and so we will not 
attempt justifications. 

The extreme-value property states that a continuous function on a closed 
interval [a, b] attains both a maximum and minimum value. 

Theorem 1.5.11. If / is continuous on a closed interval [a, b], then there exists 
a real number c in [a, b] for which /(c) < f(x) for all x in [a, b] and a real number 
d in [a, b] for which /(d) > fix) for all x in [a, b\. 

The following examples show the necessity of the two conditions of the the- 
orem (that is, the function must be continuous and the interval must be closed 
in order to ensure the conclusion). 

Example 1.5.8. The function f(x) = x 2 attains neither a maximum nor a 
minimum value on the interval (0, 1). Indeed, given any point a in (0, 1), f(x) > 
f(a) whenever a < x < 1 and f(x) < /(a) whenever < x < a. Of course, this 
does not contradict the theorem because (0, 1) is not a closed interval. On the 
closed interval [0, 1], we have /(l) > f(x) for all x in [0, 1] and /(0) < f(x) for 
all X in [0, 1], in agreement with the theorem. 

In this example the extreme values of / occurred at the endpoints of the 
interval [—1, 1]. This need not be the case. For example, if g(t) = sin(t), then, 
on the interval [0, 2ir], g has a minimum value of —1 at t = 4^ and a maximum 
value of 1 at t = ^ . 

Example 1.5.9. Let 

( 1 

, . ,, 1 < x < or < x < 1, 

0. 

See Figure 1.5.5. Then / does not have a maximum value: if a < 0, then 
f(x) > /(a) for any x > 0, and if a > 0, then f(x) > f(a) whenever < x < a. 
Similarly, / has no minimum value: if a > 0, then f(x) < /(a) for any x < 0, 
and if a < 0, then f(x) < f(a) whenever a < x < 0. The problem this time is 
that / is not continuous at x = 0. Indeed, if e is an infinitesimal, then /(e) is 
infinite, and, hence, not infinitesimally close to /(0) = 0. 

Exercise 1.5.7. Find an example of a continuous function which has both a 
minimum value and a maximum value on the open interval (0, 1). 

Exercise 1.5.8. Find an example of function which has a minimum value and 
a maximum value on the interval [0, 1], but is not continuous on [0, 1]. 
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Figure 1.5.5: A function with no minimum or maximum value on [—1,1] 



The intermediate-value property states that a continuous function attains 
all values between any two given values of the function. 

Theorem 1.5.12. If / is continuous on the interval [a, b] and m is any value 
betwen /(a) and fib), then there exists a real number c in [a, 6] for which 
/(c) = m. 

The next example shows that a function which is not continuous need not 
satisfy the intermediate- value property. 

Example 1.5.10. If H is the Heaviside function, then H{— 1) = and H{\) = 
1, but there does not exist any real number c in [—1,1] for which H(c) = ^, 
even though < \ < 1. 



1.6 The derivative 



We now return to the problem of rates of change. Given y = f{x), for any 
infinitesimal dx we let 

dy = f(x + dx)-f(x). (1.6.1) 

If y is a continuous function of x, then dy is infinitesimal and, if dx 7^ 0, the 

ratio 

dy = f{x + dx) - f(x) 

dx dx 

is a hyperreal number. If -# is finite, then its shadow, if it is the same for all 
values of dx, is the rate of change of y with respect to x, which we will call the 
derivative of y with respect to x. 
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Definition 1.6.1. Given y = f(x), suppose 

dy f(x + dx) - f(x) 



(1.6.3) 



dx dx 

is finite and has the same shadow for all nonzero infinitesimals dx. Then we call 

*(£) <'■"> 

the derivative of y with respect to x. 

Note that the quotient in (1.6.3) will be infinite if f(x + dx) — }'{x) is not 
an infinitesimal. Hence a function which is not continuous at x cannot have a 
derivative at x. 

There are numerous ways to denote the derivative of a function y = f(x). 
One is to use -^- to denote, depending on the context, both the ratio of the 
infinitesimals dy and dx and the shadow of this ratio, which is the derivative. 
Another is to write f'(x) for the derivative of the function /. We will use both 
of these notations extensively. 

Example 1.6.1. If y = x 2 , then, for any nonzero infinitesimal dx, 

dy = (x + dx) 2 - x 2 = (x 2 + 2xdx + {dx) 2 ) - x 2 = {2x + dx)dx. 

Hence 

— = 2x + dx ~ 2x, 
dx 

and so the derivative of y with respect to x is 

— = 2x. 
dx 

Example 1.6.2. If f(x) = Ax, then, for any nonzero infinitesimal dx, 

f(x + dx) - f(x) _ 4(x + dx) - Ax _ Adx _ f 
dx dx dx 

Hence f'(x) = 4. Note that this implies that f(x) has a constant rate of change: 
every change of one unit in x results in a change of 4 units in f(x). 

Exercise 1.6.1. Find ^ if y = 5x - 2. 
Exercise 1.6.2. Find g| if y = x 3 . 
Exercise 1.6.3. Find f'{x) if f(x) = 4x 2 . 

To denote the rate of change of y with respect to I at a particular value of 
x, say, when x = a, we write 

dy_ 

dx 
If y = f(x), then, of course, this is the same as writing /'(a). 



(1.6.5) 
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Figure 1.6.1: y = x$ is continuous, but not differentiable, at x = 



Example 1.6.3. If y = x 2 , then we saw above that ^ = 2x. Hence the rate 
of change of y with respect to x when x = 3 is 

= (2)(3) = 6. 



<iy_ 

dx 



a:— 3 



Example 1.6.4. If f(x) = x5 , then for any infinitesimal dx, 
f(0 + dx)-f(0) = f(dx) = (dx)t, 

which is infinitesimal. Hence / is continuous at x = 0. Now if dx ^ 0, then 
f(0 + dx)-f(0) (cte)f _ 1 



dx 



dx (dx)3 



Since this is infinite, / does not have a derivative at X = 0. In particular, this 
shows that a function may be continuous at a point, but not differentiable at 
that point. See Figure 1.6.1. 

Example 1.6.5. If f(x) = y/x, then, as we have seen above, for any x > and 
any nonzero infinitesimal dx, 



Vx 


+ dx — 


\fx 


{Vx + dx - 
(x + dx) 


-Vx) 

— X 


\/x 


+ dx + \/x 
dx 



\Jx + dx + J~x 



\]x + dx + Jx 



V x + dx + J~x 
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It now follows that 

f(x + dx)-f(x) 1 . 1 1 



dx yjx + dx + yfx y/x+^/x 2^/x 

Thus 

/'(*) = -±=. 

2^/x 

For example, the rate of change of y with respect to x when x = 9 is 

/'(9) ' ' 



2^9 6' 
We will sometimes also write 

lf(x) (1.6.6) 

for f'(x). With this notation, we could write the result of the previous example 

as 

d r- 1 

Ix 



dx 2t/x 

Definition 1.6.2. Given a function /, if /'(a) exists we say / is differentiable 
at a. We say / is differentiable on an open interval (a, b) if / is differentiable at 
each point x in (a, b). 

Example 1.6.6. The function y = x 2 is differentiable on (—00, oo). 

Example 1.6.7. The function f(x) = yjx is differentiable on (0, oo). Note that 
/ is not differentiable at x = since /(0 + dx) = f(dx) is not defined for all 
infinitesimals dx. 

Example 1.6.8. The function f(x) = x^ is not differentiable at x = 0. 

1.7 Properties of derivatives 

We will now develop some properties of derivatives with the aim of facilitating 
their calculation for certain general classes of functions. 

To begin, if f(x) = k for all x and some real constant fc, then, for any 
infinitesimal dx, 

f(x + dx)-f(x) = k-k = 0. (1.7.1) 

Hence, if dx ^ 0, 

fix + dx) — f(x) 

— ' ^-^ = 0, (1.7.2) 

dx 

and so fix) = 0. In other words, the derivative of a constant is 0. 
Theorem 1.7.1. For any real constant k, 

— k = 0. (1.7.3) 

dx 

d 
Example 1.7.1. — 4 = 0. 

dx 
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1.7.1 Sums and differences 

Now suppose u and v are both differentiable functions of x. Then, for any 
infinitesimal dx, 

d(u + v) = (u(x + dx) + v(x + dx)) — (u(x) — v(x)) 
= (u(x + dx) — u(x)) + (v(x + dx) — v(x)) 
= du+dv. (1.7.4) 

Hence, if dx ^ 0, 

d{u + v) = du + dv_ 

dx dx dx 

In other words, the derivative of a sum is the sum of the derivatives. 

Theorem 1.7.2. If / and g are both differentiable and s(x) = f(x) + g(x), 
then 

s'(x) = f(x)+g'(x). (1.7.6) 

Example 1.7.2. If y = x 2 + ^/x, then, using our results from the previous 
section, 

dy d , o, d . i— . 1 

:(x 2 ) + -j-Wx) = 2x-' 



dx dx dx 2tJx 

A similar argument shows that 



— (u-v) = —-— (17 7) 

dx dx dx 



Exercise 1.7.1. Find the derivative of y = x 2 + 5. 
Exercise 1.7.2. Find the derivative of f(x) = ^fx — x 2 + 3. 

1.7.2 Constant multiples 

If c is any real constant and u is a differentiable function of x, then, for any 
infinitesimal dx, 

d(cu) = cu(x + dx) — cu(x) = c(u(x + dx) — u(x)) = cdu. (1.7.8) 

Hence, if dx ^ 0, 

d(cu) du 

dx dx 

In other words, the derivative of a constant times a function is the constant 
times the derivative of the function. 

Theorem 1.7.3. If c is a real constant, / is differentiable, and g(x) = cf(x), 
then 

g'(x) = cf'(x). (1.7.10) 
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Example 1.7.3. If y = 5x 2 , then 



du d , , N . 

-* = 5 — (x 2 ) = 5(2z) = 10a;. 

dx ax 



Exercise 1.7.3. Find the derivative of y = 8x 2 . 
Exercise 1.7.4. Find the derivative of f(x) = A^fx + 15. 

1.7.3 Products 

Again suppose u and v are differentiable functions of x. Note that, in partic- 
ular, u and v are continuous, and so both du and dv are infinitesimal for any 
infinitesimal dx. Moreover, note that 

u(x + dx) = u(x) + du and v(x + dx) = v(x) + dv. (1.7.11) 

Hence 

d(uv) = u(x + dx)v(x + dx) — u(x)v(x) 

= (u(x) + du)(v(x) + dv) — u(x)v(x) 

= (u(x)v(x) + u(x)dv + v(x)du + dudv) — u{x)v{x) 

= udv + vdu + dudv , (1.7.12) 



and so, if dx ^ 0, 



d(uv) dv du , dv dv du 

-^— L = u— + v— + du—~u—+v— (1.7.13) 

ax ax ax ax ax ax 

Thus we have, for any differentiable functions u and v, 

d , . dv du , 

dx {uv) = u Tx + v Tx> (L7 - 14) 

which we call the product rule. 

Theorem 1.7.4. If / and g are both differentiable and p(x) = f(x)g(x), then 

p'(x) = f(x)g'(x)+g(x)f(x). (1.7.15) 

Example 1.7.4. We may use the product rule to find a formula for the deriva- 
tive of a positive integer power of x. We first note that if y = x, then, for any 
infinitesimal dx, 

dy = (x + dx) — x = dx, (1.7.16) 

and so, if dx ^ 0, 

dy dx 

dx dx 

Thus we have 

±x = l, (1.7.18) 
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as we should expect, since y = x implies that y changes at exactly the same 
rate as x. 

Using the product rule, it now follows that 

d 2 d d d 

—x = — (x ■ x) = x— x + x— x = x + x = 2x, (1.7.19) 

dx dx dx dx 

in agreement with a previous example. Next, we have 

— x 3 = x—x 2 + x 2 —x = 2x 2 + x 2 = 3x 2 (1.7.20) 

dx dx dx 

and 

—x 4 = x—x 3 + x 3 —x = 3x 3 + x 3 = Ax 3 . (1.7.21) 

dx dx dx 

At this point we might suspect that for any integer n > 1, 

^-x n = nx n - 1 . (1.7.22) 

dx 

This is in fact true, and follows easily from an inductive argument: Suppose we 
have shown that for any k < n, 



dx 

Then 



d x k = kx k -\ (1.7.23) 



±x n = x — x n - 1 +x n - 1 —x 
dx dx dx 

= x((n - l)x n ~ 2 ) + x n ~ 



(1.7.24) 



We call this result the power rule. 
Theorem 1.7.5. For any integer n > 1, 

—x n = nx n - 1 . (1.7.25) 

dx 

We shall see eventually, in Theorems 1.7.7, 1.7.10, and 2.7.2, that the power 
rule in fact holds for any real number n ^ 0. 

Example 1.7.5. When n = 34, the power rule shows that 

—x 34 = 34a: 33 . 
dx 

Example 1.7.6. If f(x) = 14a; 5 , then, combining the power rule with our result 
for constant multiples, 

f'(x) = 14(5a; 4 ) = 70x 4 . 
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Exercise 1.7.5. Find the derivative of y = 13a; 5 . 

Example 1.7.7. Combining the power rule with our results for constant mul- 
tiples and differences, we have 

d . o 

— (3a; — ox) = 6x — 5. 
dx 

Exercise 1.7.6. Find the derivative of f(x) = 5a: 4 — 3a; 2 . 
Exercise 1.7.7. Find the derivative of y = 3a; 7 — 3a; + 1. 

1.7.4 Polynomials 

As the previous examples illustrate, we may put together the above results to 
easily differentiate any polynomial function. That is, if n > 1 and a n , a„_i, 
. . . , ao are any real constants, then 

— (a n x n + a„_ix™ _1 + • • • + a 2 x 2 + a\x + a ) 
ax 

= na n x n ~ x + (n- l)a n _ia;"~ 2 H h 2a 2 x + oi- (1.7.26) 

Example 1.7.8. If p(x) = 4x 7 - 13a; 3 - a; 2 + 21, then 

p{x) = 28a; 6 - 39a; 2 - 2x. 

Exercise 1.7.8. Find the derivative of f(x) = 3a; 5 — 6a; 4 — 5a; 2 + 13. 

1.7.5 Quotients 

If u is a differentiable function of x, u(x) ^ 0, and dx is an infinitesimal, then 



u ) u(x + dx) u(x) 
1 1 



u(x) + du u(x) 
u — (u + du) 
u(u + du) 
—du 



u(u + du) 

Hence, since u + du ~ u, if dx ^ 0, 

du 
d ( 1 \ ~fa. 1 du 



(1.7.27) 



dx \uj u(u + du) u 2 dx 



(1.7.28) 
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Theorem 1.7.6. If / is differentiable, f(x) ^ 0, and 

1 
lip 



9(x) = -^, (1-7.29) 



then 



Example 1.7.9. If 



then 



/'(*) 
(/(*) 



ff{*) = -7T%2- (1-7-30) 



m = ^ 



f'(x) = -- i -2x 
a; 4 



Note that the result of the previous example is the same as we would have 
obtained from applying the power rule with n = —2. In fact, we may now show 
that the power rule holds in general for negative integer powers: If n < is an 
integer, then 

i-* n = i- f-^r) = --^ ■ i-™-"- 1 ) = nxn ~ 1 - a- 7 - 31 ) 

ax ax \ x n J x zn 

Hence we now have our first generalization of the power rule. 
Theorem 1.7.7. For any integer n/0, 

4-x n = nx n ~ 1 . (1.7.32) 

ax 

Example 1.7.10. If 

f(x) = 3a; 2 =, 

x' 

then fix) = 3a; 2 — 5a; -7 , and so 

f'( x ) = 6x + 35a;~ 8 = 6x 



35 

«8 



x° 

Now suppose u and v are both differentiable functions of x and let 

u 

y = -■ 

V 

Then u = vy, so, as we saw above, 

du = vdy + ydv + dvdy = ydv + [v + dv)dy. (1.7.33) 

Hence, provided v(x) ^ 0, 

du - ydv du ~ 7, dv vdu - udv 

dy= -r- = ~, — = — ; r^- 1.7.34 

V + dv v + dv v(v + dv) 
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Thus, for any nonzero infinitesimal dx, 



(1.7.35) 



du dv du dv 

dy V T X - U T X „_ V T X ~ U T X 

dx v(v + dv) v 2 

This is the quotient rule. 
Theorem 1.7.8. If / and g are differentiable, g( X ) ^ 0, and 

*(*) = ff d-7-36) 

then 

,, s 9{x)f'{x) - f{ X )g'( X ) , 17 „m 

q{x) = WiY ■ ( 7) 

One consequence of the quotient rule is that, since we already know how to 
differentiate polynomials, we may now differentiate any rational function easily. 

Example 1.7.11. If 

,, , 3a; 2 -6a; + 4 



then 



/'(*) 



X 2 + 1 



(x 2 + l)(6a: - 6) - (3a; 2 - 6a; + 4) (2a;) 

( X 2 + l) 2 
6a: 3 - 6x 2 + 6a; - 6 - 6a; 3 + 12a; 2 - 8a; 

(x 2 + l) 2 
6a; 2 — 2a; — 6 



(a; 2 + l) 2 ' 
Example 1.7.12. We may use either 1.7.30 or 1.7.37 to differentiate 

V=^- l . d.7.38) 

In either case, we obtain 

dy 5 d , 2 , ^ 10a; 



(x 2 + l) = - (1.7.39) 



da: (x 2 + l) 2 da; v (a; 2 + 1) 



Exercise 1.7.9. Find the derivative of 

14 
4a; 3 — 3a; 

Exercise 1.7.10. Find the derivative of 

4a; 3 -1 
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1.7.6 Composition of functions 

Suppose y is a differentiable function of u and u is a differentiable function of 
x. Then y is both a function of u and a function of x, and so we may ask for 
the derivative of y with respect to x as well as the derivative of y with respect 
to u. Now if dx is an infinitesimal, then 

du = u(x + dx) — u(x) 



is also an infinitesimal (since u is continuous). If du ^ 0, then the derivative of 
y with respect to u is equal to the shadow of -#. At the same time, if dx ^ 0, 
the derivative of u with respect to x is equal to the shadow of 4^ . But 



dy du dy 
du dx dx' 

and the shadow of ^f is the derivative of y with respect to i. It follows that 
the derivative of y with respect to x is the product of the derivative of y with 
respect to u and the derivative of u with respect to x. Of course, du is not 
necessarily nonzero even if dx ^ (for example, if u is a constant function), but 
the result holds nevertheless, although we will not go into the technical details 
here. 

We call this result the chain rule. 

Theorem 1.7.9. If y is a differentiable function of u and u is a differentiable 
function of x, then 

d JL = d JL d Jl. (1 . 7 .4i) 

dx du dx 
Not that if we let y = f{u), u = g{x), and 

K x ) = f °9{x) = f{g{xj), 

then 

dy i dy , du , 

^ = M:E) '^ = /(5(:E)) ' and ^ = 5( " ) - 

Hence we may also express the chain rule in the form 

h'(x) = f(g(x))g'(x). (1.7.42) 

Example 1.7.13. If y = 3u 2 and u = 2x + 1, then 

dy du du , . , . 

— = —— = (6w)(2) = 12u = 24:r+ 12. 

dx du dx 

We may verify this result by first finding y directly in terms of x, namely, 

12a; + 3, 



o 2 

y = 3u -- 


= 3(2 


x+lf = 


3{4:x 2 


+ 4:X + 


1) = 12a; 2 


and then differentiating 


directly: 










dy 
dx 


d , 
= dx^ X 


2 + 12 


x + 3)-- 


= 24a; + 12 
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Note that if we want to evaluate gf when, for example, x = 2, we may either 
evaluate the final form, that is, 

= (24a; + 12)| x=a = 48 + 12 = 60, 



rfy 

rf.T 



or, noting that u = 5 when a; = 2, the intermediate form, that is, 



rf.7' 



I2u\ u=5 = 60. 



In other words, 



rfy 
da: 



z=2 



rfy 
du 



u — 5 



rf.r 



z=2 



Exercise 1.7.11. If y = u 3 + 5 and m = a; 2 — 1, find 



(/;r 



Example 1.7.14. If h(x) = \Jx 2 + 1, then /i(x) = f(g(x)) where f(x) 
and g(x) = a; 2 + 1. Since 



f'(x) 



2^x 



and g'(x) = 2x, 



it follows that 



ft'(x) = f(g(x))g'(x) 



2x 



2y/x 2 + 1 " Vx 2 + 1 ' 



Exercise 1.7.12. Find the derivative of f(x) = y4x + 6. 
Exercise 1.7.13. Find the derivative of y = (x 2 + 5) 10 . 



Example 1.7.15. As we saw in Example 1.2.5, if M is the mass, in grams, of 
a spherical balloon being filled with water and r is the radius of the balloon, in 
centimeters, then 

4 



and 



dM 

dr 



M = -irr grams 



47rr grams/centimeter, 



a result which we may verify easily now using the power rule. Suppose water 
is being pumped into the balloon so that the radius of the balloon is increasing 
at the rate of 0.1 centimeters per second when the balloon has a radius of 10 
centimeters. Since M is a function of time t, as well as a function of the radius 
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of the balloon r, we might wish to know the rate of change of M with respect 
to t. Since we are given that 



dr 
~dt 



= 0.1 centimeters/second, 

r=10 



we may use the chain rule to find that 

= (4007r) (0.1) = 407T grams/second. 



dM 
~df 



dM 
dr 



r=10 



dr 
~dl 



Exercise 1.7.14. Suppose A is the area and r is the radius of a circular wave 
at time t. Suppose when r = 100 centimeters the radius of the circle is increasing 
at a rate of 2 centimeters per second. Find the rate at which the area of the 
circle is growing when r = 100 centimeters. 

Exercise 1.7.15. If water is being pumped into a spherical balloon at the rate 
of 100 grams per second, find the rate of change of the radius r of the balloon 
when the radius of the balloon is r = 15 centimeters. 

As an important special case of the chain rule, suppose n ^ is an integer, 
g is a differentiable function, and h(x) = (g(x)) n . Then h is the composition of 
f(x) = x n with g, and so, using the chain rule, 

h'(x) = f'(g(x))g'(x) = n(0(a;)) n -yOc)- (1.7.43) 

If we let u = g(x), we could also express this result as 

(1.7.44) 

Example 1.7.16. With n = 10 and g(x) = x 1 + 3, we have 

-^-(x 2 + 3) 10 = 10(x 2 + 3) 9 (2a;) = 20a;(x 2 + 3) 9 . 
dx 

Example 1.7.17. If 

«.: 15 



d 




i du 


n 


n- 




U - 


= nu 




dx 




dx 



(x 4 + 5) 2 ' 
then we may apply the previous result with n = — 2 and g(x) = x A + 5 to obtain 

120x 3 



f'(x) = -30(a; 4 + 5)~ 3 (4a; 3 ) 



(x 4 + 5) 3 ' 



We may use the previous result to derive yet another extension to the power 
rule. If n ^ is an integer and y = X" , then y n = x, and so, assuming y is 
differentiable, 
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Hence 



ny n - 1 ^ = l, (1.7.46) 

dx 



from which if follows that 



dv 1 1 1 „ 1 / i \ 1_n lii 

-?/-"=- (a;») = -x--\ (1.7.47) 



da; ny™ 1 n n V / n 

showing that the power rule works for rational powers of the form -. Note that 
the above derivation is not complete since we began with the assumption that 
y = X" is differentiable. Although it is beyond the scope of this text, it may be 
shown that this assumption is justified for x > if n is even, and for all x ^ 
if n is odd. 

Now if m ^ is also an integer, we have, using the chain rule as above, 



d m 


d / i \ m 




— X n 
dx 


= ^K) 






/ A" 1 " 1 1 i 

= m [ x n —x n 


-l 




V / n 






m m - 1 , 1 , 
= — X » " r " 






n 






m m , 

= X " . 






n 





(1.7.48) 

Hence we now see that the power rule holds for any non-zero rational exponent. 
Theorem 1.7.10. If r ^ is any rational number, then 

4-x r = rx r ~ 1 . (1.7.49) 

dx 

Example 1.7.18. With r = ^ in the previous theorem, we have 

d r- 1 _i 1 

'X = -X 2 



dx 2 2-^/x' 

in agreement with our earlier direct computation. 

Example 1.7.19. If y = x» , then 

rfj/ _ 2 _i _ 2 
rfa; 3 3xf ' 

Note that -4*- is not defined at x = 0, in agreement with our earlier result showing 
that y is not differentiable at 0. 

Exercise 1.7.16. Find the derivative of fix) = 5x& . 

We may now generalize 1.7.44 as follows: If u is a differentiable function of 
x and r ^ is a rational number, then 

_ u '' = rw r - 1 — . (1.7.50) 

ace ax 
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Example 1.7.20. If f{x) = Vx 2 + 1, then 

f'(x)= 1 -(x 2 + l)- 1 H2x) 



2 V ' K ' y^Tj- 

Example 1.7.21. If 

9(t) ' 



i 4 + 5' 
then 

g'(t) = (-l)(t 4 + 5)- 2 (4i 3 )= 4/> 



(i 4 + 5) 2 ' 



Exercise 1.7.17. Find the derivative of 

4 

y ~ Vx^Tl' 

Exercise 1.7.18. Find the derivative of f(x ) = (x 2 + 3x-5) 10 (3a; 4 -6x+4) 12 . 

1.7.7 Trigonometric functions 

If y = sin(a;) and w = cos(a;), then, for any infinitesimal dx, 

dy = sin(x + dx) — sin(a;) 

= sin(z) cos(da:) + sin(da;) cos(x) — sin(a;) 

= sin(x)(cos((ia;) — 1) + cos(a;) sin(dir) (1.7.51) 

and 

dw = cos(a; + dx) — cos(x) 

= cos(a;) cos(dx) — sin(a;) sin(da;) — cos(x) 

= cos(a;)(cos((ia;) — 1) — sin(x) sin(efe). (1.7.52) 

Hence, if dx ^ 0, 

dy , .s'm(dx) . , . 1 - cos(dx) ^ oX 

-^-=cos(x) — - — --smi — > — - (1.7.53) 

dx dx dx 

and 

dw , . s'm(dx) , „ 1 — cos(da;) 

= _ s i n ( a; ) — ^ — ^+cos(a;) ^ — i. (1.7.54) 

dx da; dx 

Now from (1.5.13) we know that 

(dx) 2 
< 1 - cos(dx) < - — '—, (1.7.55) 

and so 

1 — cos(a;) dx 

^-iT^T- (1 - 7 ' 56) 
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Hence 



1 — cos(dx) 
dx 
is an infinitesimal. Moreover, from (1.5.36), we know that 



(1.7.57) 



sm(dx) 

— - ~ 1. 1.7.58 

dx 

Hence 

-^ ~ cos(a;)(l) - sin(x)(0) = cos(x) (1.7.59) 

dx 

and 

— ~ - sin(a;)(l) + cos(x)(0) = - sin(a:) (1.7.60) 

dx 

That is, we have shown the following. 
Theorem 1.7.11. For all real values x, 

— — sin(x) = cos(x) (1.7.61) 

dx 

and 

— cos(x) = — sin(a;). (1.7.62) 

dx 

Example 1.7.22. Using the chain rule, 

— cos(4x) = — sin(4x) — (4a;) = — 4sin(4a;). 
dx at 

Example 1.7.23. If /(£) = sin (£), then, again using the chain rule, 

fit) = 2sin(t)— sin(i) = 2sin(i) cos(t). 
at 

Example 1.7.24. If g(x) = cos(x 2 ), then 

g'(x) = — sin(a; )(2x) = — 2xcos(a; ). 
Example 1.7.25. If f(x) = sin (Ax), then, using the chain rule twice, 

f'(x) = 3sin 2 (4x)— sin(4x) = 12sin 2 (4x) cos(4x). 



Exercise 1.7.19. Find the derivatives of 

y = cos(3i + 6) and w = sin 2 (i) cos 2 (4t). 
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Exercise 1.7.20. Verify the following: 

(a) — tan(i) = sec 2 (i) (b) — cot(i) = - csc 2 (t) 

at at 

(c) — sec(i) = sec(i) tan(i) (d) — csc(i) = — csc(i) cot(t) 

Exercise 1.7.21. Find the derivative of y = sec 2 (3£). 
Exercise 1.7.22. Find the derivative of /(£) = tan 2 (3£). 

1.8 A geometric interpretation of the derivative 

Recall that if y = /(x), then, for any real number Ax, 

Ay f(x + Ax)-f(x) 



Ax Ax 



(1.8.1) 



is the average rate of change of y with respect to x over the interval [a;, x + Ax\ 
(see (1.2.7)). Now if the graph of y is a straight line, that is, if /(x) = mx + b 
for some real numbers m and b, then (1.8.1) is m, the slope of the line. In fact, 
a straight line is characterized by the fact that (1.8.1) is the same for any values 
of a; and Ax. Moreover, (1.8.1) remains the same when Ax is infinitesimal; that 
is, the derivative of y with respect to x is the slope of the line. 

For other differentiable functions /, the value of (1.8.1) depends upon both 
x and Ax. However, for infinitesimal values of Ax, the shadow of (1.8.1), that 
is, the derivative -#, depends on x alone. Hence it is reasonable to think of -# 
as the slope of the curve y = f(x) at a point x. Whereas the slope of a straight 
line is constant from point to point, for other differentiable functions the value 
of the slope of the curve will vary from point to point. 

If / is differentiable at a point a, we call the line with slope f'ifl) passing 
through (a,f(a)) the tangent line to the graph of / at (a, /(a)). That is, the 
tangent line to the graph of y = /(x) at x = a is the line with equation 

y=f'(a)(x-a) + f(a). (1.8.2) 

Hence a tangent line to the graph of a function / is a line through a point on 
the graph of / whose slope is equal to the slope of the graph at that point. 

Example 1.8.1. If f(x) = x 5 - 6x 2 + 5, then 

/'(x) = 5x 4 - 12x. 

In particular, /' (—¥) = -jq, and so the equation of the line tangent to the 
graph of / at x = —\ is 

101 ( 1\ 111 
See Figure 1.8.1 
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-20 



Figure 1.8.1: A tangent line to the graph of f(x) = x 5 — Qx 2 + 5 

Exercise 1.8.1. Find an equation for the line tangent to the graph of 

f(x) = 3x 4 -Qx + 3 

at x = 2. 

Exercise 1.8.2. Find an equation for the line tangent to the graph of 

y = 3 sin (x) 

at x = —r . 



1.9 Increasing, decreasing, and local extrema 

Recall that the slope of a line is positive if, and only if, the line rises from left 
to right. That is, if m > 0, f(x) = mx + b, and u < v, then 

f(v) = mv + b 

= mv — mu + mu + b 

= m(v — u) + mu + b 

> mu + b 

= /(«). (1.9.1) 

We should expect that an analogous statement holds for differentiable functions: 
if / is differentiable and f'(x) > for all x in an interval (a, b), then f(v) > f{u) 
for any v > u in (a, b). This is in fact the case, although the inference requires 
establishing a direct connection between slope at a point and the average slope 
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over an interval, or, in terms of rates of change, between the instantaneous 
rate of change at a point and the average rate of change over an interval. The 
mean-value theorem makes this connection. 

1.9.1 The mean-value theorem 

Recall that the extreme value property tells us that a continuous function on a 
closed interval must attain both a minimum and a maximum value. Suppose / 
is continuous on [a, b], differentiable on (a, b), and / attains a maximum value 
at c with a < c < b. In particular, for any infinitesimal dx, /(c) > /(c+ dx), 
and so, equivalently, f(c+ dx) — f(c) < 0. It follows that if dx > 0, 

f(c+dx)-f(c) 

J -— '- J -±-t < 0, (1.9.2) 

dx 

and if dx < 0, 

MM > . (L93) 

dx 
Since both of these values must be infmitesimally close to the same real number, 
it must be the case that 

/(c + ^-/(c)^ 
dx 

That is, we must have /'(c) = 0. A similar result holds if / has a minimum at 
c, and so we have the following basic result. 

Theorem 1.9.1. If / is differentiable on (a, b) and attains a maximum, or a 
minimum, value at c, then /'(c) = 0. 

Now suppose / is continuous on [a, b], differentiable on (a, b), and /(a) = 
f(b). If / is a constant function, then /'(c) = for all c in (a, b). If / is not 
constant, then there is a point c in (a, b) at which / attains either a maximum 
or a minimum value, and so /'(c) = 0. In either case, we have the following 
result, known as Rolle's theorem. 

Theorem 1.9.2. If / is continuous on [a, 6], differentiable on (a, 6), and /(a) = 
f(b), then there is a real number c in (a, b) for which /'(c) = 0. 

More generally, suppose / is continuous on [a, b] and differentiable on (a, b). 
Let 

g(x) = f(x)- m ~ f{a \ x-a)-f(a). (1.9.5) 

o — a 

Note that g(x) is the difference between f(x) and the corresponding y value 
on the line passing through (a,f(a)) and (b,f(b)). Moroever, g is continuous 
on [a, b], differentiable on (o, 6), and g(a) = = g(b). Hence Rolle's theorem 
applies to g, so there must exist a point c in (a, 6) for which </(c) = 0. Now 

g(c) = f(x) , (1.9.6) 
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Figure 1.9.1: Graph of f(x) 



3x + 1 with its tangent line at x 



so we must have 



That is, 



= </(c) = /'(c) 



f'(c) 



(c)- 


/(&) - /(«) 


6 — a 


/(&)- 


"/(«) 



b — a 



(1.9.7) 



(1.9.8) 



which is our desired connection between instantaneous and average rates of 
change, known as the mean-value theorem. 

Theorem 1.9.3. If / is continuous on [a, b] and differentiable on (a, b), then 
there exists a real number c in (a, b) for which 



/'(c) 



Hb)-f(a)) 

b — a 



(1.9.9) 



Example 1.9.1. Consider the function f(x) = x 3 — 3x + 1 on the interval [0, 2]. 
By the mean- value theorem, there must exist at least one point c in [0, 2] for 
which 

//(c) = /(2)-/(0) = 3-l =1 

w 2-0 2 

Now f'(x) = 3a; 2 - 3, so /'(c) = 1 implies 3c 2 - 3 = 1. Hence c = J\. Note 

that this implies that the tangent line to the graph of / at x = •» / 1 is parallel 
to the line through the endpoints of the graph of /, that is, the points (0, 1) and 
(2,3). See Figure 1.9.1. 



1.9. INCREASING, DECREASING, AND LOCAL EXTREMA 43 

1.9.2 Increasing and decreasing functions 

The preceding discussion leads us to the following definition and theorem. 

Definition 1.9.1. We say a function / is increasing on an interval I if, whenever 
a < b are points in /, /(a) < f(b). Similarly, we say / is decreasing on I if, 
whenever a < b are points in /, /(a) > /(&). 

Now suppose / is a defined on an interval / and f'(x) > for every x in I 
which is not an endpoint of /. Then given any a and b in /, by the mean- value 
theorem there exists a point c between a and b for which 

/(& ;- /(a) = /'(c) > 0. (1.9.10) 

b — a 

Since b — a > 0, this implies that /(&) > f(a). Hence / is increasing on /. A 
similar argument shows that / is decreasing on / if f'(x) < for every x in / 
which is not an endpoint of I. 

Theorem 1.9.4. Suppose / is defined on an interval /. If f'(x) > for every 
x in I which is not an endpoint of /, then / is increasing on I. If /'(en) < for 
every x in / which is not an endpoint of J, then / is decreasing on /. 

Example 1.9.2. Let f(x) = 2x 3 - 3x 2 - Ylx + 1. Then 

f(x) = 6x 2 - 6a; - 12 = 6{x 2 - x - 2) = 6(x - 2){x + 1). 

Hence f'(x) = when x = — 1 and when x = 2. Now x — 2 < for x < 2 
and x — 2 > for x > 2 , while x + 1 < for x < — 1 and x + 1 > when 
x > —I. Thus f'(x) > when x < — 1 and when ir > 2, and /'(a;) < when 
— 1 < x < 2. It follows that / is increasing on the intervals (— oo,— 1) and 
(2,oo), and decreasing on the interval (—1,2). 

Note that the theorem requires only that we know the sign of /' at points 
inside a given interval, not at the endpoints. Hence it actually allows us to make 
the slightly stronger statement that / is increasing on the intervals (— oo,— 1] 
and [2,oo), and decreasing on the interval [—1,2]. 

Since / is increasing on (— oo, —1] and decreasing on [—1, 2], the point ( — 1, 8) 
must be a high point on the graph of /, although not necessarily the highest 
point on the graph. We say that / has a local maximum of 8 at x = — 1. 
Similarly, / is decreasing on [—1,2] and increasing on [2,oo), and so the point 
(2, —19) must be a low point on the graph of /, although, again, not necessarily 
the lowest point on the graph. We say that / has a local minimum of —19 at 
x = 2. From this information, we can begin to see why the graph of / looks as 
it does in Figure 1.9.2. 

Definition 1.9.2. We say / has a local maximum at a point c if there exists an 
interval (a, b) containing c for which /(c) > f(x) for all x in (a, b). Similarly, we 
say / has a local minimum at a point c if there exists an interval (a, b) containing 
c for which /(c) < f{x) for all x in (a, 6). We say / has a local extremum at c 
if / has either a local maximum or a local minimum at c. 
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Figure 1.9.2: Graph of f(x) = 2x 3 - 3x 2 - Ylx + 1 



We may now rephrase Theorem 1.9.1 as follows. 

Theorem 1.9.5. If / is differentiable at c and has a local extremum at c, then 
f(c) = 0. 

As illustrated in the preceding example, we may identify local minimums of 
a function / by locating those points at which / changes from decreasing to 
increasing, and local maximums by locating those points at which / changes 
from increasing to decreasing. 

Example 1.9.3. Let f(x) = x + 2sin(x). Then f'(x) = 1 + 2cos(x), and so 
f'{x) < when, and only when, 

/ \ l 
cos(a;j < . 



For < x < 27r, this occurs when, and only when, 

2tt An 

— < x < — . 

3 3 

Since the cosine function has period 2tt, if follows that f'(x) < when, and 
only when, x is in an interval of the form 

27T 47T 

h 2irn, h 27m 

3 '3 

for n = 0, ±1,±2, . . .. Hence / is decreasing on these intervals and increasing 
on intervals of the form 

2tt 2tt 

h 27m, h 27m 

3 3 
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10 — 



y = x + 2 sin(x) 



-10 



10 



-10 — 



Figure 1.9.3: Graph of f(x) = x + 2 sin(a?) 



n = 0, ±1, ±2, .... It now follows that / has a local maximum at every point of 

the form 

2tt 

x = h 2nn 

3 

and a local minimum at every point of the form 

4-7T 

x = h 27m. 

3 

From this information, we can begin to see why the graph of / looks as it does 
in Figure 1.9.3. 

Exercise 1.9.1. Find the intervals where f(x) = x 3 — 6x is increasing and 
the intervals where / is decreasing. Use this information to identify any local 
maximums or local minimums of /. 

Exercise 1.9.2. Find the intervals where f{x) = 5a; 3 — 3a; 5 is increasing and 
the intervals where / is decreasing. Use this information to identify any local 
maximums or local minimums of /. 

Exercise 1.9.3. Find the intervals where f(x) = x + sin(a;) is increasing and 
the intervals where / is decreasing. Use this information to identify any local 
maximums or local minimums /. 



1.10 Optimization 

Optimization problems, that is, problems in which we seek to find the greatest or 
smallest value of some quantity, are common in the applications of mathematics. 
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Because of the extreme-value property, there is a straightforward algorithm 
for solving optimization problems involving continuous functions on closed and 
bounded intervals. Hence we will treat this case first before considering functions 
on other intervals. 

Recall that if /(c) is the maximum, or minimum, value of / on some interval 
/ and / is differentiable at c, then /'(c) = 0. Consequently, points at which the 
derivative vanishes will play an important role in our work on optimization. 

Definition 1.10.1. We call a real number c where /'(c) = a stationary point 
of /. 

1.10.1 Optimization on a closed interval 

Suppose / is a continuous function on a closed and bounded interval [a, b]. By 
the extreme- value property, / attains a maximum, as well as a minimum value, 
on [a,b\. In particular, there is a real number c in [a,b] such that /(c) > f(x) 
for all x in [a, b]. If c is in (a, b) and / is differentiable at c, then we must have 
/'(c) = 0. The only other possibilities are that / is not differentiable at c, c = a, 
or c = b. Similar comments hold for points at which a minimum value occurs. 

Definition 1.10.2. We call a real number c a singular point of a function / if 
/ is defined on an open interval containing c, but is not differentiable at c. 

Theorem 1.10.1. If / is a continuous function on a closed and bounded interval 
[a, b], then the maximum and minimum values of / occur at either (1) stationary 
points in the open interval (a, 6), (2) singular points in the open interval (a, b), 
or (3) the endpoints of [a, b]. 

Hence we have the following procedure for optimizing a continuous function 
/ on an interval [a, b]: 

(1) Find all stationary and singular points of / in the open interval (a, b). 

(2) Evaluate / at all stationary and singular points of (a,b), and at the end- 
points a and b. 

(3) The maximum value of / is the largest value found in step (2) and the 
minimum value of / is the smallest value found in step (2). 

Example 1.10.1. Consider the function g(t) = t — 2cos(£) defined on the 
interval [0, 2w]. Then 

</(£) = l + 2sin(i), 

and so g'(t) = when 

1 
sm(i) = --. 

For t in the open interval (0, 271"), this means that either 

7n 
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Figure 1.10.1: Graph of g(t) = t — 2cos(£) on the interval [0, 2ir] 



or 



t 



IItt 
~6~' 



That is, the stationary points of g in (0,27r) are -^ and -#. Note that g is 
differentiable at all points in (0, 271"), and so there are no singular points of g in 
(0, 2tt). Hence to identify the extreme values of g we need evaluate only 



7tt 



IItt 



and 

5 (2tt) = 2tt - 2 ss 4.28319. 

Thus g has a maximum value of 5.39724 at t = -^ and a minimum value of —2 
at £ = 0. See Figure 1.10.1 for the graph of g on [0, 271"]. 



5(0) = -2, 




77T r- 

— + V3s, 
6 


5.39724, 


IItt r 
— -V3 


« 4.02753 



Exercise 1.10.1. Find the maximum and minimum values of 

/(x)=* 2 + - 
x 

on the interval [1,4]. 



Exercise 1.10.2. Find the maximum and minimum values of g(t) = t — sin(2£) 
on the interval [0,7r]. 
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Example 1.10.2. Suppose we inscribe a rectangle R inside the ellipse E with 
equation 

Ax 2 + y 2 = 16, 

as shown in Figure 1.10.2. If we let (x, y) be the coordinates of the upper 
right-hand corner of R, then the area of R is 

A={2x){2y)=Axy. 

Since (x, y) is a point on the upper half of the ellipse, we have 

y = \J\§-Ax 2 = 2\/A -x 2 , 

and so 

A = 8x\/A - x 2 . 

Now suppose we wish to find the dimensions of R which maximize its area. That 
is, we want to find the maximum value of A on the interval [0, 2]. Now 

dA -2x /- -8a: 2 + 8(4 -a; 2 ) 32 - 16a: 2 

— = 8a; ; + 8V4 - x 2 = ; v = , 

dx 2^4 - x 2 V4-x 2 V<±-x 2 

Hence ^ = 0, for x in (0, 2), when 32 - 16a: 2 = 0, that is, when x = y/2. Thus 
the maximum value of A must occur at x = 0, x = \/2, or x = 2. Evaluating, 
we have 

A\ x=V z = 8V2V2 = W, 

and 

A\ x=2 = 0. 

Hence the rectangle R inscribed in E with the largest area has area 16 when 
x = \/2 and y = 2\/2. That is, R is 2^2 by Ay/2. 

Exercise 1.10.3. Find the dimensions of the rectangle R with largest area 
which may be inscribed in the ellipse with equation 

2 9 

a 2 b 2 ' 

where a and b are positive real numbers. 

Exercise 1.10.4. A piece of wire, 100 centimeters in length, is cut into two 
pieces, one of which is used to form a square and the other a circle. Find the 
lengths of the pieces so that sum of the areas of the square and the circle are 
(a) maximum and (b) minimum. 

Exercise 1.10.5. Show that of all rectangles of a given perimeter P, the 
square is the one with the largest area. 
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5^ 



4x 2 + y 2 = 1G 




-5- 



Figure 1.10.2: A rectangle inscribed in the ellipse 4x 2 



1(3 



1.10.2 Optimization on other intervals 

We now consider the case of a continuous function / on an interval I which is 
either not closed or not bounded. The extreme-value property does not apply 
in this case, and, as we have seen, we have no guarantee that / has an extreme 
value on the interval. Hence, in general, this situation requires more careful 
analysis than that of the previous section. 

However, there is one case which arises frequently and which is capable of 
a simple analysis. Suppose that c is a point in / which is either a stationary 
or singular point of /, and that / is differentiable at all other points of /. If 
f'(x) < for all a: in J with x < c and f'(x) > for all x in I with c < x, then 
/ is decreasing before c and increasing after c, and so must have a minimum 
value at c. Similarly, if f'(x) > for all x in I with x < c and fix) < for all 
X in I with c < x, then / is increasing before c and decreasing after c, and so 
must have a maximum value at c. The next examples will illustrate. 

Example 1.10.3. Consider the problem of finding the extreme values of 



y = Ax 2 + 



1 



on the interval (0,oo). Since 

^ = 8x - — 
dx x 2 

we see that -£ < when, and only when, 



8a; < 



.-)() 
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Figure 1.10.3: Graph of y = Ax 2 



This is equivalent to 



x° < 



so gj < on (0,oo) when, and only when, < x < J. Similarly, we see that 
-¥- > when, and only when, x > \. Thus y is a decreasing function of x on 
the interval (0, ^) and an increasing function of x on the interval (i,oo), and 
so must have an minimum value at x = \. Note, however, that y does not have 
a maximum value: given any x = c, if c < | we may find a larger value for y by 
using any < x < c, and if c > | we may find a larger value for y by using any 
x > c. Thus we conclude that y has a minimum value of 3 at x = |, but does 
not have a maximum value. See Figure 1.10.3. 



Example 1.10.4. Consider the problem of finding the shortest distance from 
the point A = (0, 1) to the parabola P with equation y = x 2 . If (x, y) is a point 
on P (see Figure 1.10.4), then the distance from A to (x,y) is 

D = V(x - 0) 2 + {y - l) 2 = Vx 2 + (x 2 - l) 2 . 

Our problem then is to find the minimum value of D on the interval (— oo, oo). 
However, to make the problem somewhat easier to work with, we note that, 
since D is always a positive value, finding the minimum value of D is equivalent 
to finding the minimum value of D 2 . So letting 



x 2 + {x 2 - l) 2 = x 2 + x 4 - 2x 2 



1 = x 4 - x 2 + 1, 



z = D z 

our problem becomes that of finding the minimum value of z on (— oo, oo). Now 

dz 



dx 



\x" - 2x, 
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Figure 1.10.4: Distance from A = (0, 1) to a point (x, y) on the graph of y 



so ^| = when, and only when, 

= 4x 3 - 2x = 2x(2x 2 - 1), 

that is, when, and only when, x = — i=, x = 0, or x = -4=. Now 2x < 

when — oo < x < and 2a; > when < x < oo, whereas 2a; 2 — 1 < when 
— j= < x < —7= and 2a; 2 — 1 > either when x < — j= or when x > -hg . Taking 

the product of 2a; and 2a; 2 — 1, we see that 4^ < when x < — j= and when 

< x < 4=, and gf > when —4= < a; < and when x > 4=. It follows 

that z is a decreasing function of x on I — oo, — j=) and on ( 0, -^? J , and is an 

increasing function of x on I — h- , J and on l-j-,oo). 



It now follows that z has a local minimum of | at x 



of 1 at 



—7= , a local maximum 

v2 

x = U, ana anotner local minimum ol | at x = -4-. Note that | is 
the minimum value of z both on the the interval (— oo,0) and on the interval 
(0, oo); since z has a local maximum of 1 at x = 0, it follows that | is in fact the 
minimum value of z on (—00,00). Hence we may conclude that the minimum 

distance from A to P is ^, and the points on P closest to A are I — 751 5 ) an d 

I -t=, i). Note, however, that z does not have a maximum value, even though 
it has a local maximum value at x = 0. See Figure 1.10.5 for the graph of z. 



Exercise 1.10.6. Find the point on the parabola y = x 2 which is closest to 
the point (3, 0). 
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Figure 1.10.5: Graph of z = x 4 



Exercise 1.10.7. Show that of all rectangles of a given area A, the square is 
the one with the shortest perimeter. 

Exercise 1.10.8. Show that of all right circular cylinders with a fixed volume 
V, the one with with height and diameter equal has the minimum surface area. 



Exercise 1.10.9. Find the points on the ellipse 4x 2 
closest to and (b) farthest from the point (0, 1). 



y 



16 which are (a) 



1.11 Implicit differentiation and rates of change 

Many curves of interest are not the graphs of functions. For example, for con- 
stants a > and b > 0, the equation 



2 2 

x y 
^ 2+ ¥ 



1 



(1.11.1) 



describes an ellipse E which intersects the x-axis at (— a,0) and (a,0) and the 
y-axis at (— &, 0) and (6,0) (see Figure 1.11.1). The ellipse E is not the graph 
of a function since, for any —a < x < a, both 



I x, \/a 2 — x 2 I 

V a J 



(1.11.2) 



and 



- V a 2 — x 2 



(1.11.3) 
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Figure 1.11.1: The ellipse 



IT. 
I, 2 



lie on E. Nevertheless, we expect that E will have a tangent line at every point. 
Note, however, that the tangent lines at (— a,0) and (a,0) are vertical lines, and 
so do not have a slope. At all other points, we may find the slope of the tangent 
line by treating y in (1.11.1) as a function of x, differentiating both sides of the 
equation with respect to x, and solving for -p. In general, given a function / 
of x and y and a constant c, this technique will work to find the slope of the 
curve defined by an equation f(x,y) = c. Note that in applying this technique 
we are assuming that y is differentiable. This is in fact true for a wide range 
of relationships defined by /, but the technical details are beyond the scope of 
this text. 



Example 1.11.1. Using a = 2 and b 
both sides of the equation by 4, 



1, (1.11.1) becomes, after multiplying 



■V 



4. 



Differentiating both sides of this equation by x, and remembering to use the 
chain rule when differentiating y 2 , we obtain 

dy 
2x + 8y— = 0. 
ax 



Solving for — 



dx 



we have 



dy 
dx 



x 



which is defined whenever y ^ (corresponding to the points (—2, 0) and (2,0), 
at which, as we saw above, the slope of the tangent lines is undefined). For 
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1 i ,u^ 




-2- 



V = i 



Figure 1.11.2: The ellipse x 2 + 4y 2 = 4 with tangent line at M , ^ J 



example, we have 



dx 



(x,j/) = (l,#) 2v^' 

and so the equation of the line tangent to the ellipse at the point I lj 2 ) 

73 



-^M {x ~ l)+ 2 



See Figure 1.11.2. 



Example 1.11.2. Consider the hyperbola H with equation 

x 2 - Axy + y 2 = 4. 

Differentiating both sides of the equation, remembering to treat y as a function 
of x, we have 

dy dy 

2x - Ax— -Ay + 2y — = 0. 
dx dx 



Solving for -#, we see that 



dy Ay — 2x 2y — x 





dx 


2y — Ax y — 2x 


example, 


dy 


2 1 




dx 


(x,y) = (2fl) 4 2 
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Figure 1.11.3: The hyperbola x 2 — Axy + y 2 = 4 with tangent line at (2, 0) 

Hence the equation of the line tangent to H at (2, 0) is 

1 



V 



-(x-2). 



See Figure 1.11.3. 



Exercise 1.11.1. Find 



Exercise 1.11.2. Find 



'III 

(l.V, 



dx 



if y 2 + 8xy - x 2 



10. 



(x,y) = (2-l) 



if x 2 y + 3xy - Yly = 2. 



Exercise 1.11.3. Find the equation of the line tangent to the circle with equa- 
tion x 2 + y 2 = 25 at the point (3,4). 

Exercise 1.11.4. Find the equation of the line tangent to the ellipse with 
equation x 2 + xy + y 2 = 19 at the point (2, 3). 

The technique described above, known as implicit differentiation, is also 
useful in finding rates of change for variables related by an equation. The next 
examples illustrate this idea, with the first being similar to examples we saw 
earlier while discussing the chain rule. 

Example 1.11.3. Suppose oil is being poured onto the surface of a calm body 
of water. As the oil spreads out, it forms a right circular cylinder whose volume 
is 

V = Trr 2 h, 

where r and h are, respectively, the radius and height of the cylinder. Now 
suppose the oil is being poured out at a rate of 10 cubic centimeters per second 



.->(> 
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Figure 1.11.4: Ships A and B passing a point P 

and that the height remains a constant 0.25 centimeters. Then the volume of 
the cylinder is increasing at a rate of 10 cubic centimeters per second, so 



dV 

~dt 



10 cm /sec 



at any time t. Now with h = 0.25, 



V = 0.257rr 2 , 



SO 



Hence 



dV 
~dt 



1 dr 
—Trr— -. 

2 dt 



20 



cm/sec. 



dr _ 2 dV 
dt irr dt irr 
For example, if r = 10 centimeters at some time t = to, then 



dr 

~di, 



20 
IOtt 



0.6366 cm/sec. 



Example 1.11.4. Suppose ship A, headed due north at 20 miles per hour, and 
ship B, headed due east at 30 miles per hour, both pass through the same point 
P in the ocean, ship A at noon and ship B two hours later (see Figure 1.11.4). 
If we let x denote the distance from A to P t hours after noon, y denote the 
distance from B to P t hours after noon, and z denote the distance from A to 
B t hours after noon, then, by the Pythagorean theorem, 



y 
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Differentiating this equation with respect to t, we find 

dz dx dy 

2z— = 2x— + 2y-^-, 
dt dt dt ' 

or 

dz dx dy 

z — = x h y — . 

dt dt dt 

For example, at 4 in the afternoon, that is, when t = 4, we know that 

x= (4) (20) = 80 miles, 

y= (2) (30) = 60 miles, 

and 

z = \/80 2 + 60 2 = 100 miles, 

so 

dz dx dy 

100— = 80— + 60-^ miles/hour. 

dt dt dt ' 



Since at any time t, 



and 



we have 

dz 

dt 



dx 

— = 20 miles/hour 
dt ' 

— = 30 miles/hour, 



(80)(20) + (60)(30) „, ., .. 

^ — - — '- i — - — '- = 34 miles/hour. 

100 ' 



Exercise 1.11.5. Suppose the volume of a cube is growing at a rate of 150 
cubic centimeters per second. Find the rate at which the length of a side of the 
cube is growing when each side of the cube is 10 centimeters. 

Exercise 1.11.6. A plane flies over a point P on the surface of the earth at 
a height of 4 miles. Find the rate of change of the distance between P and the 
plane one minute later if the plane is traveling at 300 miles per hour. 

Exercise 1.11.7. Suppose the length of a rectangle is growing at a rate of 2 
centimeters per second and its width is growing at a rate of 4 centimeters per 
second. Find the rate of change of the area of the rectangle when the length is 
10 centimeters and the width is 12 centimeters. 

1.12 Higher-order derivatives 

Given two quantities, y and x, with y a function of a;, we know that the derivative 
-P- is the rate of change of y with respect to x. Since -^ is then itself a function 
of a;, we may ask for its rate of change with respect to x, which we call the 
second-order derivative of y with respect to x and denote -j-|. 
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Example 1.12.1. If y = 4x 5 - 3x 2 + 4, then 

— = 20x 4 - 6x, 
dx 



and so 



d 2 y o 

_| = 80x 3 - 6. 
dx z 



Of course, we could continue to differentiate: the third derivative of y with 
respect to x is 

i. — 24Dr 2 

(to 3 " ^^ ' 

the fourth derivative of y with respect to x is 

d 4 y 

and so on. 

If y is a function of a; with y = f(x), then we may also denote the second 
derivative of y with respect to x by f"(x), the third derivative by f'"(x), and 
so on. The prime notation becomes cumbersome after awhile, and so we may 
replace the primes with the corresponding number in parentheses; that is, we 
may write, for example, f""(x) as f^ A '{x). 

Example 1.12.2. If 



then 



/(*) = 


1 

X 


/'(*) = 


1 

X z 


/"(*) = 


2 
a; 3 ' 


f'"{x) = 


6 

~x^ 


f {A \x)- 


24 

X 5 



and 



Exercise 1.12.1. Find the first, second, and third-order derivatives of y 
sin(2x). 

Exercise 1.12.2. Find the first, second, and third-order derivatives of f(x) 

VAxTi 
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1.12.1 Acceleration 

If x is the position, at time t, of an object moving along a straight line, then we 
know that 

-I (L12 ' 1 » 

is the velocity of the object at time t. Since acceleration is the rate of change 
of velocity, it follows that the acceleration of the object is 

dv d x 

Example 1.12.3. Suppose an object, such as a lead ball, is dropped from a 
height of 100 meters. Ignoring air resistance, the height of the ball above the 
earth after t seconds is given by 

x (t) = 100 - AM 2 meters, 

as we discussed in Section 1.2. Hence the velocity of the object after t seconds 
is 

v(t) = — 9.8t meters/second 

and the acceleration of the object is 

a(t) = —9.8 meters/second . 

Thus the acceleration of an object in free-fall near the surface of the earth, ignor- 
ing air resistance, is constant. Historically, Galileo started with this observation 
about acceleration of objects in free-fall and worked in the other direction to 
discover the formulas for velocity and position. 

Exercise 1.12.3. Suppose an object oscillating at the end of a spring has 
position x = 10cos(7r£) (measured in centimeters from the equilibrium position) 
at time t seconds. Find the acceleration of the object at time t = 1.25. 

1.12.2 Concavity 

The second derivative of a function / tells us the rate at which the slope of 
the graph of / is changing. Geometrically, this translates into measuring the 
concavity of the graph of the function. 

Definition 1.12.1. We say the graph of a function / is concave upward on an 
open interval (a, b) if /' is an increasing function on (a, b). We say the graph of 
a function / is concave downward on an open interval (a, b) if /' is a decreasing 
function on (a,b). 

To determine the concavity of the graph of a function /, we need to determine 
the intervals on which /' is increasing and the intervals on which /' is decreasing. 
Hence, from our earlier work, we need identify when the derivative of /' is 
positive and when it is negative. 



00 
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Figure 1.12.1: Graph of f(x) = 2x 3 - 3x 2 - 12x + 1 



Theorem 1.12.1. If / is twice differentiable on (a, 6), then the graph of / is 
concave upward on (a, b) if f"(x) > for all x in (a, &), and concave downward 
on (a, b) if /'' ' (x) < for all x in (o, b). 

Example 1.12.4. If f(x) = 2x 3 - 3x 2 - Ylx + 1, then 

f'(x) = 6x 2 - 6a; - 12 

and 

f"(x) = Ylx - 6. 

Hence f"{x) < when x < | and f"(x) > when x > |, and so the graph 
of / is concave downward on the interval (-co, |) and concave upward on the 
interval (i oo). One may see the distinction between concave downward and 
concave upward very clearly in the graph of / shown in Figure 1.12.1. 

We call a point on the graph of a function / at which the concavity changes, 
either from upward to downward or from downward to upward, a point of in- 



flection. In the previous example, (^, — g") is a point of inflection. 



Exercise 1.12.4. Find the intervals on which the graph of f(x) = 5x 3 — 3x 5 
is concave upward and the intervals on which the graph is concave downward. 
What are the points of inflection? 



1.12.3 The second- derivative test 

Suppose c is a stationary point of / and /"(c) > 0. Then, since /" is the 
derivative of /' and /'(c) = 0, for any infinitesimal dx ^ 0, 

f(c + dx)-f(c) f(c+dx) 



dx 



dx 



>0. 



(1.12.3) 
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Figure 1.12.2: Graph of f{x) 



It follows that fie + dx) > when dx > and /' (c + dx) < when cfe < 0. 
Hence / is decreasing to the left of c and increasing to the right of c, and so / 
has a local minimum at c. Similarly, if /"(c) < at a stationary point c, then 
/ has a local maximum at c. This result is the second- derivative test. 



Example 1.12.5. If f{x) 



then 



f(x) = 4x 3 



3aT 



x 2 (Ax - 3) 



and 



f'[x) = 12a; 2 -6x = 6x(2x - 1) 



Hence / has stationary points x = and x = f . Since 



/"(0) = 



and 



/" 



>0, 



we see that / has a local minimum at x = f. Although the second derivative 
test tells us nothing about the nature of the critical point x = 0, we know, since 
/ has a local minimum at x = |, that / is decreasing on (0, |) and increasing 
on (|, oo ) . Moreover, since 4x — 3 < for all x < 0, it follows that /'(a;) < for 
all x < 0, and so / is also decreasing on (— oo,0). Hence / has neither a local 
maximum nor a local minimum at x = 0. Finally, since f"(x) < for < x < \ 
and fix) > for all other x, we see that the graph of / is concave downward 
on the interval (0, 2) and concave upward on the intervals (—00, 0) and (2, 00). 
See Figure 1.12.2. 
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Exercise 1.12.5. Use the second-derivative test to find all local maximums 
and minimums of 

f(x) = x+-. 



Exercise 1.12.6. Find all local maximums and minimums of g(t) = 5t 7 — 7i E 



Chapter 2 

Integrals 



2.1 Integrals 

We now turn our attention to the other side of Zeno's arrow paradox. In the 
previous chapter we began with the problem of finding the velocity of an object 
given a function which defined the position of the object at every instant of 
time. We now suppose that we are given a function which specifies the velocity 
v of an object, moving along a straight line, at every instant of time t, and we 
wish to find the position x of the object at time t. There are two approaches 
to finding x; we will investigate both, leading us to the fundamental theorem of 
calculus. 

First, from our earlier work we know that v is the derivative of x. That is, 

dx 

Tt= V - (2 - L1) 

Hence to find x we need to find a function which has v for its derivative. 

Definition 2.1.1. Given a function / defined on an open interval (a, b), we call 
a function F an integral of / if F'(x) = f(x) for all x in (a, b). 

Example 2.1.1. If f(x) = 3a; 2 , then F{x) = x 3, is an integral of / on (—00, oo) 
since F'(x) = 3a; 2 for all x. However, note that F is not the only integral of /: 
for other examples, both G{x) = a; 3 + 4 and H(x) = x 3 + 15 are integrals of / as 
well. Indeed, since the derivative of a constant is the function L[x) = x 3 + c 
is an integral of / for any constant c. 

In general, if F is an integral of /, then G{x) = F{x) + c is also an integral 
of / for any constant c. Are there any other integrals of /? That is, if we start 
with both F and G being integrals of /, does it follow that G(x) — F(x) = c for 
some constant c and for all x? To answer this question, first note that if we let 
H(x) = G(x)-F(x), then 

H'{x) = G'{x) - F'{x) = f(x) - f(x) = (2.1.2) 

63 
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for all x. Hence H is an integral of the constant function g(x) = for all x. So 
our question becomes: If H'(x) = for all x, does it follow that H(x) = c for 
some constant c and all x? If it does, then 

c = H(x)=F(x)-G(x), (2.1.3) 

and indeed F and G differ by only a constant. So suppose we are given H'(x) = 
for all x in an open interval (a, 6). Then, for any two points u < v in (a, 6), it 
follows from the mean-value theorem that 

H{v)-H{u) = H'{d){u-v) (2.1.4) 

for some d in (a,b). But then H'(d) = 0, so -ff(f) — ff(w) = 0, that is, H{u) = 
H(v). Since this is true for any arbitrary points u and v in (a, b), it follows that 
H must be constant on (a, 6). 

Theorem 2.1.1. If F'(x) = G'(x) for all x in (a, 6), then there exists a constant 
c such that G(a;) = F(x) + c for all x in (a, 6). 

In particular, if F'[x) = for all x in (a, b), then i 7, is constant on (a, 6). 

Example 2.1.2. Since 

d /3 o 

x 1 +Ax) = 3a; + 4, 



da; \2 

any integral of /(a;) = 3x + 4 must be of the form 

3 

F(x) = -x 2 +Ax + c 

for some constant c. 

We denote an integral of a function / by 

f(x)dx. (2.1.5) 

The motivation for this notation will be more evident once we discuss the fun- 
damental theorem of calculus. 

Example 2.1.3. Since 

— (4a; 3 — sin(x)) = 12a: 2 — cos(a;), 
dx 

it follows that 

(12a; — cos(x))dx = Ax — sin(a;) + c, 

where, as before, c is some constant. 
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From our rules for differentiation, it follows easily that 

1 

n+ 1 



x n dx = — — x n+1 + c (2.1.6) 



for every rational ii^-1, and 

sin(x)dx = — cos(x) + c, (2-1-7) 

cos(x)dx = sin(x) + c, (2.1.8) 

sec (x)dx = tan(x) + c, (2.1.9) 

csc 2 (x)dx = — csc(x) + c, (2.1.10) 

/sec(.)tan(^ = sec (a; ) + c, (2.1.11) 



and 



csc(a;) cot(x)dx = — csc(a;) + c, (2.1.12) 



where in each case c represents an arbitrary constant. Note that differentiation 
of the right-hand side of each of the above verifies these statements. Moreover, 
if follows from our work with derivatives that for any functions / and g and any 
constant k, 



(f(x) + g(x))dx = J f(x)dx + J g(x)dx, (2.1.13) 

(f(x)-g(x))dx= f f(x)dx- I ' g{x)dx, (2.1.14) 



and 



Example 2.1.4. 



Example 2.1.5. 



kf{x)dx = k f{x)dx. (2.1.15) 



5 

(5a; 3 -6x + 8)dx= -x 4 - 3x 2 + 8x + c. 

v ' 4 



(sin(x) — 4cos(x))rfa; = — cos(a;) — 4sin(x) + c. 
Example 2.1.6. Making an adjustment for the chain rule, we see that 



f ! 

/ sm(5x)dx = cos(5a;) + c. 

J 5 



(i(i 
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Figure 2.1.1: Parallel curves y = |a; 3 — 7x + c 

Example 2.1.7. Suppose we wish to find the integral F{x) of fix) = 5x 2 — 7 
for which F(l) = 10. Now 



(5x 2 — 7)dx = -x 3 — 7x + c, 
o 



M» 



F(x) = -x 3 -7x + c 



for some constant c. Now we want 



10 = F(l) = --7 + c, 



so we must have 



c= 10 + 7- 



5 _ 46 

3 ~ y 



Hence the desired integral is 



n/ N 5 3 , 46 

J F(x) = -x 3 -7s+y. 

Note that, geometrically, from the family of parallel curves with equations of 
the form y = |x 3 — 7x + c, we are finding the one that passes through the point 
(1, 10). Figure 2.1.1 shows five such curves, with the graph of F in blue. 



Exercise 2.1.1. Evaluate each of the following: 

1 



(a) / (a; + 3)dx 



(b) / -j dx 



(c) / (3sin(ir) — 5 sec(x) ta,n(x))dx (d) I A\fx dx 
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Exercise 2.1.2. Find an integral F of f(x) = 5x 4 — Ax which satisfies F(2) = 
12. 

Returning to our original problem, we can now say that if v(t) is, at time 
t, the velocity of an object moving along a straight line and x(t) is the object's 
position at time t, then 

x(t) = I v{t)dt. (2.1.16) 

However, note that (2.1.16) is little more than a restatement of (2.1.1) with new 
notation. 

Example 2.1.8. Suppose the velocity of an object oscillating at the end of a 
spring is 

v(t) = — 20sin(5i) centimeters/second. 

If x(t) is the position of the object at time t, then 

x(t) = — 20sm(5t)dt = 4cos(5t) + c centimeters 

for some constant c. If in addition we know that the object was initially 4 
centimeters from the origin, that is, that x(0) = 4, then we would have 

A = x(0) =4 + c. 

Hence we would have c = 0, and so 

x(t) = 4cos(5i) centimeters 

completely specifies the position of the object at time t. 

Exercise 2.1.3. Suppose the velocity of an object at time t is v(t) = 10sin(i) 
centimeters per second. Find x(t), the position of the object at time t, if x(0) = 
10 centimeters. 

2.1.1 The case of constant acceleration 

Galileo was the first to notice that, ignoring the effects of air resistance, objects 
in free fall near the surface of the earth fall with constant acceleration. Suppose 
that x(t) 7 f(£), and a(t) specify, at time t, the position, velocity, and acceleration 
of an object moving along a straight line, and, moreover, suppose 

a(t) = g (2.1.17) 

for some constant g and all values of t. Since acceleration is the derivative of 
velocity, it follows that 



v(t)= / a(t)dt= / gdt = gt + c (2.1.18) 
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for some constant c. Now if we let vq = v(0), the velocity of the object at time 
t = 0, then 

Vo = v (Q)=c. (2.1.19) 



Hence 



Next we see that 



v(t)=gt + v . (2.1.20) 



x(t) = / v(t)dt = (gt + v )dt = -gt 2 + v t + c (2.1.21) 

for some constant c. If we let Xq = x(0), the position of the object at time t = 0, 
then 

x = x(0) = c. (2.1.22) 

Hence we have 

x{t)= l -gt 2 + v t + x . (2.1.23) 

In the important case of an object in free fall near the surface of the earth, g is 
the constant acceleration due to gravity. When working in units of meters and 
seconds, and taking up as the positive direction, we have g = —9.8 meters per 
second per second, and when working with units of feet and seconds g = — 32 
feet per second per second. Hence, in the former case, (2.1.23) becomes 

x (t) = -AM 2 + v t + x Q (2.1.24) 

and, in the latter case, (2.1.23) becomes 

x(t) = -I6t 2 + v Q t + x . (2.1.25) 

Example 2.1.9. Suppose an object is thrown upward from atop a 10 meter 
tall tower with an initial velocity of 20 meters per second. Then, using (2.1.24), 
the position of the object after t seconds is 

x(t) = -AM 2 + 20t + 10 meters. 

Hence, for example, since the object will reach its maximum height when its 
velocity is 0, we see that the object reaches its maximum height when 

-9.8i+20 = 0, 

that is, when 

20 
t = — « 2.04 seconds. 
9.8 

Thus the object will reach a maximum height of 

ie(2.04) w -4.9(2.04) 2 + 20(2.04) + 10 w 30.41 meters. 

Exercise 2.1.4. For an object in free-fall near the surface of Mars, g = —3.69 
meters per second per second. Find the maximum height reached by an object 
thrown vertically into the air from atop a 10 meter tall tower on Mars with an 
initial velocity of 20 meters per second. 
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2.2 Definite integrals 

We will now consider another approach to solving the problem of finding the 
position function given a velocity function. As above, suppose v(t) specifies, at 
time t, the velocity of an object moving along a straight line, starting at time 
t = a and ending at time t = b. Let x(t) be the position of the object at time t 
and let Xo = x(a) be the initial position of the object. 

Recall that if an object travels at constant velocity r for a time T , then its 
change in position is simply rT (to the right if r > and to the left if r < 0). 
It follows that if v(t) were constant, say v(t) = r for some fixed real number r 
and all t in [a, b], then 

x(t) = x(a) + r(t - a) = x + r(t - a) (2-2.1) 

since the object will have been traveling, after starting at Xq, at a velocity r for 
a time period of length t — a. 

If v(t) is not constant, but doesn't vary by much, then (2.2.1) will give a good 
approximation to x(t). In general, v may change significantly over the interval, 
but, as long as v is a continuous function, we can subdivide [a, b] into small 
intervals for which v(t) does not change by much over any given subinterval. 
That is, if we choose points to, ti, ti, ■ • • , t n such that 

a = t <h <t 2 < ■■■ <t n = b, (2.2.2) 

let Ati = ti — ti-i for i = 1,2,3, ... ,n, and choose real numbers t\, t* 2 , £3, . . . , 
t* n so that £i_i < t* < ti, then 

v(t*)AU (2.2.3) 

will approximate well the change of position of the object from time t = ti_\ to 
time t = ti, provided v is continuous and Ati is small. It follows that we may 
approximate the position of the object at time t = b by adding together all the 
approximate changes in position over the subintervals. That is, 

x(b) w x(a) + v(t$)Ati + v(t* 2 )At 2 + ■■■ + v(t* n )At n . (2.2.4) 

Example 2.2.1. In an earlier example, we had v(t) = — 20sin(5t) centimeters 
per second and x(0) = 4 centimeters. To approximate x(2), we will divide [0, 2] 
into four equal subintervals, each of length 0.5. That is, we will take 

t =0.0,*! = 0.5, £ 2 = 1,<3 = 1.5, £4 = 2, 

and 

Aii = 0.5, At 2 = 0.5, A£ 3 = 0.5, A£ 4 = 0.5. 

Good choices for points to evaluate v(t) are the midpoints of the subintervals. 
In this case, that means we should take 

t\ = 0.25, tl = 0.75,*3 = 1.25, £4 = 1.75. 
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Then we have 

x(2) w ar(0) + u(0.25)Ati + v(0.75)At 2 + v(1.25)At 3 + v(1.75)At 4 
= 4 - 20sin(1.25)(0.5) - 20sin(3.75)(0.5) - 20sin(6.25)(0.5) 

-20sin(8.75)(0.5) 
w -5.6897, 

Note that, from our earlier work, we know that the exact answer is 

x{2) =4cos(10) w -3.3563. 

We may improve upon our approximation by using smaller subintervals. For 
example, if we divide [0,2] into 10 equal subintervals, each of length 0.2, then 
we would have 

t = 0.0, ii = 0.2, t 2 = 0.4, t 3 = 0.6, U = 0.8, t 5 = 1.0, 
t 6 = 1.2,t 7 = 1-4, t 8 = 1-6, t 9 = 1.8, tio = 2.0 

and 

Aii = Ai 2 = ••• = Aii = 0.2. 

If we evaluate v(t) at the midpoints again, then we take 

t\ = 0.1, t* 2 = 0.3, t% = 0.5, t\ = 0.7, t\ = 0.9, 
t\ = 1.1, t? = 1.3, i* = 1.5, t* = 1.7, i* = 1.9. 

Hence we have 

x{2) w x(0) + u(0.1)Aii + w(0.3)Ai 2 + w(0.5)At 3 + h v(1.9)At 10 

= 4 + 0.2(v(0.1) + w(0.3) + w(0.5) + • ■ ■ + i>(1.9)) 

= 4 + 0.2(-20sin(0.5) - 20sin(1.5) - 20sin(2.5) 20sin(9.5)) 

w -3.6720, 

a significant improvement over our first approximation. 

Exercise 2.2.1. Suppose the velocity of an object at time t is v(t) = 10sin(i) 
centimeters per second. Let x(t) be the position of the object at time t. If 
x(0) = 10 centimeters, use the technique of the previous example to approximate 
x(3) using (a) n = 6 and (b) n = 12 subintervals. 

One question which arises immediately is why we would want to find ap- 
proximations to the position function when we already know how to find the 
position function exactly using an integral. There are two answers. First, it is 
not always possible to find an integral for a given function, even when an integral 
does exist. For example, if the velocity function in the previous example were 
v(t) = — 20sin(5t 2 ), then it would not be possible to write an expression for the 



2.2. DEFINITE INTEGRALS 71 

integral of v in terms of the elementary functions of calculus. In this case, the 
best we could do is look for good approximations for the position function x. 

Second, this approach also leads to an exact expression for the position 
function. If we let N be an infinitely large positive integer and divide [a, b] into 
an infinite number of equal subintervals of infinitesimal length 

b — a , 

dt=^ r , (2.2.5) 

then we should expect that 

x{b) ~ x(a) + v(t*)dt + v(t 2 )dt H h v(t* N )dt, (2.2.6) 

where, similar to the work above, t* is a hyperreal number in the ith subinterval. 
Rewriting this as 

a: (6) - x(a) ~ v{t\)dt + v(t*)dt H h v(t* N )dt, (2.2.7) 

we are saying that the change in position of the object from time t = a to time 
t = b is equal to an infinite sum of infinitesimal changes. Although Zeno was 
correct in saying that an infinite sum of zeros is still zero, an infinite sum of 
infinitesimal values need not be infinitesimal. 

The right-hand side of (2.2.7) provides the motivation for the following def- 
inition. 

Definition 2.2.1. Suppose / is a continuous function on a closed bounded 
interval [a, 6]. Given a positive integer N, finite or infinite, we call a set of 
numbers {to, t\,t2, ■ ■ ■ , t n } a partition of [a, b] if 

a = t < ii < t 2 < ■ ■ ■ < t N = b. (2.2.8) 

Given such a partition of [a, b], let t* denote a number with t i _ 1 < t* < ti and 
let Ati = U — ti_i, where i = 1, 2, 3, . . . ,N. We call a sum of the form 

N 

J2 f(tt)Mi = f(tt)Ah + f(t* 2 )At 2 + f(t* 3 )At 3 + ■■■ + f(t* N )At N (2.2.9) 

i=l 

a Riemann sum. If N is infinite and the partition forms subintervals of equal 
length 

dt = At 1 = At 2 = ■■■ = At n , (2.2.10) 

then we call the shadow of the Riemann sum the definite integral of / from a 

to b, which we denote 

,-b 

f{t)dt. (2.2.11) 

J a 

That is, 

J f{t)dt = sh(j2f(t*)dtj. (2.2.12) 
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Note that in order for (2.2.12) to make sense, we need the Riemann sum on 
the right-hand side to be finite (so it has a shadow) and to have the same shadow 
for all choices of infinite integers N and evaluation points t* (so the definite 
integral has a unique value). These are both true for continuous functions 
on closed bounded intervals, but the verifications would take us into subtle 
properties of continuous functions beyond the scope of this text. 

In terms of velocity and position functions, if v(t) is the velocity and x(t) 
the position of an object moving along a straight line from time t = a to time 
t = b, we may now write (combining (2.2.7) with (2.2.12)) 

x(b)-x(a) = / v(t)dt. (2.2.13) 



Since v is the derivative of a;, we might ask if (2.2.13) is true for any differentiable 
function. That is, if / is differentiable on an interval which contains the real 
numbers a and b, is it always the case that 

f(b) - f(a) = f f(t)dt? (2.2.14) 



This is in fact true, but requires more explanation than just the intuitive ap- 
proach of our example with positions and velocities. We will come back to 
this important result, which we call the fundamental theorem of calculus, after 
developing some properties of definite integrals. 

2.3 Properties of definite integrals 

Suppose / is a continuous function on a closed interval [a, 6]. Let N be a positive 
infinite integer, dx = -^, and, for i = 1, 2, • • • , N, let x* a number in the «th 
subinterval of [a, b] when it is partitioned into N intervals of equal length dx. 
We first note that if f{x) = 1 for all x in [a, b], then 

N N 

^2f{x*)dx = J2 dx = b ~ a I 2 - 3 - 1 ) 

1=1 4 = 1 

since the sum of the lengths of the subintervals must be the length of the interval. 
Hence 

rb rb 

f{x)dx= / dx = b-a. (2.3.2) 

J a 

More generally, if f(x) = k for all x in [a, b], where k is a fixed real number, 
then 

N N N 

Y] f(x*)dx = J2 k dx = kJ2 dx = H b - a), (2.3.3) 

2=1 i—1 i — 1 

and so 

b rb 

f{x)dx= kdx = k{b-a). (2.3.4) 
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That is, the definite integral of a constant is the constant times the length of 
the interval. In particular, the integral of 1 over an interval is simply the length 
of the interval. 

If / is an arbitrary continuous function and k is a fixed constant, then 

N N 

Y J kf{x*)dx=kY J f{^*)dXi (2.3.5) 

i=\ i=\ 

and so 

,-b 

kf(x)dx = k f{x)dx. (2.3.6) 

J a 

That is, the definite integral of a constant times / is the constant times the 
definite integral of /. 

If g is also a continuous function on [a, 6], then 

N N N 

^ (/«) + g(x*))dx = J2 f(x*)dx + J29( x *) dx > ( 2 - 3 - 7 ) 

2—1 2—1 i— 1 

and so 

b fb fb 

(f(x)+g(x))dx= / f{x)dx+ / g{x)dx. (2.3.8) 

J a J a 

Now suppose c is another real number with a < c < b. If the closed interval 
[a, c] is divisible into M intervals of length dx, where M is a positive infinite 
integer less than N, then 

N M N 

Y J f«)dx = Y J f«) dx + E K<) dx ( 2 - 3 - 9 ) 

i=l i=\ i=M+l 

implies that 

b t'C t'b 

f(x)dx= / f{x)dx+ / f{x)dx. (2.3.10) 

•J a J c 

This is a reflection of our intuition that, for an object moving along a straight 
line, the change in position from time t = a to time t = b is equal to the change 
in position from time t = a to time t = c plus the change in position from time 
t = c to time t = b. Although we assumed that [a, c] was divisible into an integer 
number of subintervals of length dx, the result holds in general. 

The final properties which we will consider revolve around a basic inequality. 
If / and g are both continuous on [a, b] with f(x) < g(x) for all x in [a, b], then 

JV AT 

Y,f{x*)dx<Y,9{x*)dx, (2.3.11) 

i=l i=l 

from which it follows that 

6 pb 

f(x)dx < / g(x)dx. (2.3.12) 
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For example, if m and M are constants with m < f(x) < M for all x in [a, 6] 
then 

!>b t>b t>b 

mdx < / f(x)dx < / Mdx, 



and so 



m(b-a)< f(x)dx < M(b- a). 



Note, in particular, that if f(x) > for all a; in [a, 6], then 

/(x)<fo > 0. 
Example 2.3.1. From the observation that 

is increasing on (— cxd, 0] and decreasing on [0, oo), it is easy to see that 



(2.3.13) 
(2.3.14) 

(2.3.15) 



1 1 

- < k < 1 

2 ~ 1 + x 2 ~ 



for all x in [—1,1]. Hence 



1 < 



1 + x 2 



7 dx < 2. 



We will eventually see, in Example 2.6.20, that 

1 1 
-dx= - w 1.5708. 

! 1 + X 2 2 

Since for any real number a, — \a\ < a < \a\ (indeed, either a = \a\ or 
a = —\a\), we have 

- \f(x)\ < f(x) < |/(a;)| (2.3.16) 

for all x in [a, b]. Hence 



b rb f'b 

\f(x)\dx< / f(x)dx< / \f(x)\dx, 



or, equivalently, 



f(x)dx 



< / \f(x)\dx. 



(2.3.17) 



(2.3.18) 



Notice that, since the definite integral is just a generalized version of summation, 
this result is a generalization of the triangle inequality: Given any real numbers 
a and b, 

\a + b\<\a\ + \b\. (2.3.19) 

The next theorem summarizes the properties of definite integrals that we 
have discussed above. 
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Theorem 2.3.1. Suppose / and g are continuous functions on [a, b], c is any 
real number with a < c < b, and k is a fixed real number. Then 

f" 

(a) / kdx = k(b — a), 

J a 

(b) / kf{x)dx = k f(x)dx, 

J a J a 

f'b f'b fb 

( c ) / (f(x)+g(x))dx = / f{x)dx+ / g{x)dx, 

fb fC fb 

(d) / f(x)dx= / f(x)dx+ / f(x)dx, 

Ja Ja Jc 

fb fb 

(e) if f(x) < g(x) for all x in [a, b], then / f(x)dx < / g(x)dx, 

J a 'J a 

(f) if m < /(x) < M for all x in [a, b], then m(6 - a) < / /(a;)dx < M(6 - a), 



(g) 



6 

f{x)dx 



■b 

< I \f(x)\dx. 



1 f 2 1 

< / - da; < 1. 



Exercise 2.3.1. Show that 



2 ~ J 1 x 

2.4 The fundamental theorem of integrals 

The main theorem of this section is key to understanding the importance of 
definite integrals. In particular, we will invoke it in developing new applications 
for definite integrals. Moreover, we will use it to verify the fundamental theorem 
of calculus. 

We first need some new notation and terminology. Suppose e is a nonzero 
infinitesimal. Intuitively, e is infinitely smaller than any nonzero real number. 
One way to express this is to note that for any nonzero real number r, 

-~0, (2.4.1) 

r 



that is, the ratio of e to r is an infinitesimal. Now we also have 

e~0, (2.4.2) 



e 2 



e 

that is, the ratio of e 2 to e is an infinitesimal. Intuitively, this means that e 2 
is infinitely smaller than e itself. This is related to a fact about real numbers: 



76 CHAPTER 2. INTEGRALS 

For any real number r with < r < 1, r 2 is smaller than r. For example, if 
r = 0.01, then r 2 = 0.0001. 

Definition 2.4.1. Given a nonzero hyperreal number e, we say another hyper- 
real number <5 is of an order less than e if - is an infinitesimal, in which case we 
write S ~ o(e). 

In other words, we have 

S ~ o(e) if and only if - ~ 0. (2.4.3) 

e 

Example 2.4.1. If a is any infinitesimal, than a ~ o(l) since y = a is an 
infinitesimal. 

Example 2.4.2. If e is any nonzero infinitesimal, then e 2 ~ o(e) since 

e 

Now suppose AT is a positive infinite integer, e = -h, and Si ~ o(e) for 
i = 1, 2, . . . , N. Then, for any positive real number r, 



e 

and so 

v 14 



< r, (2.4.4) 



e 
i=i 

Multiplying both sides by e, we have 

N 

y IA.-I <c rNf = rN 

N 



Y J —<rN. (2.4.5) 



" 1 

5^|(5i| < WVe= riV— = r. (2.4.6) 



-JV 



Since this holds for all positive real numbers r, it follows that $Z i=1 |<J»| is an 
infinitesimal. Now 



JY 



E* 



AT 

<X>i, ( 2 - 4 - 7 ) 

8=1 



and so we may conclude that J^iLi ^» ^ s an infinitesimal. In words, the sum of 
N infinitesimals of order less than -^ is still an infinitesimal. 

Now suppose B is a function that for any real numbers a < b in an open 
interval / assigns a value B(a,b). Moreover, suppose B has the following two 
properties: 

• for any a < c < b in I, B(a, b) = B(a, c) + B(c, b), and 
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• for some continuous function h and any nonzero infinitesimal dx, 

B(x,x + dx) - h(x)dx ~ o(dx) (2.4.8) 

for any x in /. 
For a positive infinite integer TV, let dx = -^ and let 

a = xq < x\ < X2 < ■ ■ ■ < xn = b (2.4.9) 

be a partition of [a, b] into N equal intervals of length dx. Then 
B(a, b) = B(x , xi) + B(xi,x 2 ) + B(x 2 ,x 3 ) + h B(xn-i,xn) 

N 

= ^B(xi_i,a;i_i + dx) 

i=\ 

N 

= \ ((B(xi-i,Xi-i + dx) — h(xi-i)dx) + h(xi-\)dx) 

N N 

= 2_, {B(xi-i,Xi-i + dx) — h(xi-i)dx) + 2_, h(xi-i)dx 

i=\ i=\ 

~ y^ (B(x i -i,x i - 1 + dx) - h(xj-i)dx) + / h(x)dx. (2.4.10) 

Since the final sum on the right is the sum of N infinitesimals of order less than 

-jy, it follows that 

B(a,b) = / h(x)dx. (2.4.11) 

<J a 

This result is basic to understanding both the computation of definite inte- 
grals and their applications. We call it the fundamental theorem of integrals. 

Theorem 2.4.1. Suppose B is a function that for any real numbers a < b in 
an open interval / assigns a value B(a, b) and satisfies 

• for any a < c < b in I, B(a, b) = B(a, c) + B(c, b), and 

• for some continuous function h and any nonzero infinitesimal dx, 

B(x,x + dx) - h(x)dx ~ o(dx) (2.4.12) 

for any x m I. 

Then 

B(a,b) = / h{x)dx (2.4.13) 



for any real numbers a and b in /. 



f{x), (2.4.16) 

(2.4.17) 
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We will look at several applications of definite integrals in the next section. 
For now, we note how this theorem provides a method for evaluating integrals. 
Namely, given a function / which is differentiable on an open interval /, define, 
for every a < b in /, 

B(a,b) = f(b) - f(a). (2.4.14) 

Then, for any a, 6, and c in I with a < c < b, 

B(a,b) = f{b) - /(a) 

= (/(&)- /(c)) + (/(c) -/(a)) 

= B{a,c) + B{c,b). (2.4.15) 

Moreover, for any infinitesimal dx and any x in /, 

-B(a;, a; + cfcc) /(x + dx) — f[x) 
dx dx 

from which it follows that 

B(x, x + dx) — f'(x)dx 
dx 
is an infinitesimal. Hence 

B(x,dx)-f'(x)dx~o(dx), (2.4.18) 

and so it follows from Theorem 2.4.1 that 

f(b) - f(a) = B{a, b)= f f'(x)dx. (2.4.19) 

J a 

This is the fundamental theorem of calculus. 

Theorem 2.4.2. If / is differentiable on an open interval J, then for every 
a < b in I, 

rb 

f'(x)dx = f(b) - f(a). (2.4.20) 

Example 2.4.3. To evaluate 

fi 
xdx, 
'o 

we first note that g(x) = x is the derivative of f(x) = -^x 2 . Hence, by Theorem 
2.4.2, 

xdx = f(l)-f(Q)= l --Q= l - 

We will write 

f(x)\ b a = f(b)-f(a) (2.4.21) 

to simplify the notation for evaluating an integral using Theorem 2.4.2. With 
this notation, the previous example becomes 

xdx = -x z = = -. 

o 2 2 
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Example 2.4.4. Since 



2 t 3 

x dx = —x + c, 
3 



we have 



i: 



2j i , 

x ax = -x" 
3 



8 1 _ 7 
3 ~ 3 ~ 3' 



Example 2.4.5. Since 

-20sm(5x)dx = 4cos(5a;) + c, 
we have 

2tt 

-20sm(5t)dt = 4cos(5i)|o 7r = 4-4 = 0. 

o 

Note that if we consider an object moving along a straight line with velocity 
v(t) = — 20sin(5£), then this definite integral computes the change in position 
of the object from time t = to time t = 2tt. In this case, the object, although 
always in motion, is in the same position at time t = 2ir as it was at time t = 2ir. 

l 
Exercise 2.4.1. Evaluate / x A dx. 



Exercise 2.4.2. Evaluate 



o 



r 

l sn\(x)da 
Jo 



Exercise 2.4.3. Suppose the velocity of an object moving along a straight line 
is v(t) = 10sin(i) centimeters per second. Find the change in position of the 
object from time t = to time t = tt. 



2.5 Applications of definite integrals 

In this section we will look at several examples of applications for definite inte- 
grals. 

2.5.1 Area between curves 

Consider two continuous functions / and g on an open interval / with fix) < 
g(x) for all x in /. For any a < b in /, let R(a, b) be the region in the plane 
consisting of the points (x,y) for which a < x < b and f(x) < y < g(x). That 
is, R(a, b) is bounded above by the curve y = g(x), below by the curve y = /(x), 
on the left by the vertical line x = a, and on the right by the vertical line x = b, 
as in Figure 2.5.1. Let 

A(a, b) = axea of R(a, b). (2.5.1) 



;so 
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Figure 2.5.1: Region R(a, b) between the graphs of y = g(x) and y = f(x) 



Clearly, for any a < c < 



A(a,b) = A(a,c) + A(c,l 



(2.5.2) 



Now for an x in / and a positive infinitesimal dx, let c be the point at which 
g(u) — f(u) attains its minimum value for x < u < x + dx and let d be the point 
at which g{u) — f(u) attains its maximum value for x < u < x + dx. Then 

(g(c) - f{c))dx < A{x, x + dx) < (g(d) - f(d))dx. (2.5.3) 

Moreover, since 

g(c) - /(c) < g(x) - f{x) < g(d) - /(d), (2.5.4) 

we also have 

(g(c) - f(c))dx < (g(x) - f(x))dx < (g(d) - f(d))dx. (2.5.5) 

Putting (2.5.3) and (2.5.5) together, we have 

\A(x, dx) - (g(x) - f(x))dx\ < ((g(d) - /(d)) - (/(c) - g(c)))dx (2.5.6) 



or 



\A(x,dx)- {g{x) - f{x))dx\ 
dx 



<(g(d)- /(d)) -(/(c) - 5 (c)) (2.5.7) 



Now since c ~ x and d ~ x, 

(g(d) - /(d)) - (g(c) - /(c)) = (g(d) - g(c)) + (/(c) - /(d)) ~ 0. (2.5. 
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Figure 2.5.2: The region bounded by the curves y = x + 2 and y = x 2 



Hence 

A(x, dx) — (g(x) — f(x))dx ~ o(dx). 

It now follows from Theorem 2.4.1 that 



(2.5.9) 



A(a,b)= / (g(s) - f(x))dx. 



(2.5.10) 



Example 2.5.1. Let A be the area of the region R bounded by the curves 
with equations y = x 2 and y = x + 2. Note that these curves intersect when 
x 2 = x + 2, that is when 

= a; 2 -a;-2 = (a; + l)(a; - 2). 

Hence they intersect at the points (— 1, 1) and (2,4), and so R is the region in 
the plane bounded above by the curve y = x + 2, below by the curve y = x 2 , on 
the right by x = — 1, and on the left by x = 2. See Figure 2.5.2. Thus we have 



(x + 2- x 2 )dx 



1 9 1 * 

x 2 + 2x x A 



2 

2 + 4 



1 1 

- -2+ - 

2 3 
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y = /M 



R 



x = b 



Figure 2.5.3: Area beneath the graph of a function / 



Exercise 2.5.1. Find the area of the region bounded by the curves y = x and 

2 

y = % ■ 

Exercise 2.5.2. Find the area of the region bounded by the curves y = x 2 
and y = 2 — x 2 . 

Exercise 2.5.3. Find the area of the region bounded by the curves y = yfx 
and y = x. 

Now consider a continuous function / on an interval [a, b] with f(x) > for 
all x in [a, b]. If A is the area of the region R bounded above by the graph of 
y = f(x), below by the graph of y = (that is, the x-axis), on the right by the 
vertical line x = a, and on the left by the graph of x = b (see Figure 2.5.3), then 



A 



b rb 

(f(x)-0)dx= / f(x)dx. 



(2.5.11) 



This gives us a geometric interpretation for a the definite integral of a nonneg- 
ative function / over an interval [a, b] as the area beneath the graph of / and 
above the x-axis. 

Example 2.5.2. If A is the area of the region R beneath the graph of y = sin(x) 
over the interval [0,7r], as in Figure 2.5.4, then 



sm(x)dx = — cos(a;)| = 1 + 1 = 2. 



On the other hand, if / is continuous on [a, b] with f{x) < for all x in 
[a, b], and A is the area of the region R above the graph of y = f{x) and below 
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Kl 



1.5 -r 



y = sin(:r) 




0.5 — 



Figure 2.5.4: Area beneath the graph of y = sin(x) 



the x axis, then 



(0-f(x))dx 



f(x)dx. 



(2.5.12) 



That is, the definite integral of a non-positive function / over an interval [a, b] 
is the negative of the area above the graph of / and beneath the x-axis. 

In general, given a continuous function / on an interval let R be the region 
bounded by the x-axis and the graph of y = f(x). If A + is the area of the part 
of R which lies above the x-axis and A~ is the area of the part of R which lies 
below the x-axis, then 

f(x)dx = A + -A~. (2.5.13) 



Example 2.5.3. Note that 



2,T 



sin(x)dx = — cos(x)|( 



-1 + 1 = 0. 



Geometrically, we can see this result in Figure 2.5.5. If R + is the the region 
beneath the graph of y = sin(x) over the interval [0, 7r] and R~ is the region 
above the graph of y = sin(x) over the interval [0,27r], then these two regions 
have the same area. Hence the integral, which is the area of R + minus the area 
of R~, is 0. 



Exercise 2.5.4. Evaluate 



x dx 



and explain the result geometrically. 



«4 
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-1.5 



Figure 2.5.5: Area of R + is the same as the area of R 



Exercise 2.5.5. Evaluate 



xdx 



and explain the result geometrically. 
Exercise 2.5.6. Explain, geometrically, why 



V 1 — x 2 dx = —. 



2.5.2 Volumes 

Consider a three-dimensional body B. Given a line, which we will call the z-axis, 
let V(a, b) be the volume of B which lies between planes which are perpendicular 
to the 2-axis and pass through z = a and z = b. Clearly, for any a < c < b, 



V(a,b) = V(a,c) + V(c,b). 



(2.5.14) 



Now suppose that, for any a < x < b, R(z) is a cross section of B perpendicular 
to the 2-axis. See Figure 2.5.6. Let A(z) be the area of R(z). We assume A is 
a continuous function of z. For a positive infinitesimal dz, let A have, on the 
interval [z, z + dz], a minimum value at c and a maximum value at d. Then 



A(c)dz < V(z, z + dz) < A(d)dz. 
Since we also have A(c) < A(z) < A(d), it follows that 

\V(z, z + dz)- A(z)dz\ < (A(d) - A{c))dz. 
Thus 



\V(z,z + dz) - A(z)dz\ 
dz 



< A(d) - A(c) 



(2.5.15) 

(2.5.16) 
(2.5.17) 
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Figure 2.5.6: Cross sections of a body perpendicular to the z-axis 



Since A is continuous and d ~ z and c ~ z, A(d) — A(c) is infinitesimal. Hence 

V{z,z + dz)- A{z)dz ~ o(dz), (2.5.18) 

from which it follows, by Theorem 2.4.1, that 



V(a,b) = [ A{z)dz 

J a 



(2.5.19) 



Example 2.5.4. The unit sphere S, with center at the origin, is the set of all 



points (x, y, z) satisfying x + y 



1 (see Figure 2.5.7). For a fixed value 



of z between —1 and 1, the cross section R(z) of S perpendicular to the z-axis 
is the set of points (x, y) satisfying the equation x 2 + y 2 = 1 — z 2 . That is, R(z) 
is a circle with radius yl — z 2 . Hence R(z) has area 

A(z) =tt(1-z 2 ). 

If V is the volume of 5, it now follows that 

»i 



V 



7r(l — z )dx 



7T Z Z 

3 



4tt 
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x 2 + y 2 + z 2 = 1 




Figure 2.5.7: The unit sphere x 2 + y 2 + z 2 = 1 



Exercise 2.5.7. For r > 0, the equation of a sphere S of radius r is x 2 



Show that the volume of S is 1 7rr 3 . 



Exercise 2.5.8. Let P be a pyramid with a square base having corners at 
(1, 1, 0), (1, —1, 0), (—1, —1, 0), and (—1, 1, 0) in the xy-plane and top vertex at 
(0, 0, 1) on the z-axis. Show that the volume of P is o. 

Example 2.5.5. Let T be the region bounded by the z-axis and the graph 
of z = x 2 for < x < 1. Let B be the three-dimensional body created by 
rotating T about the z-axis. See Figure 2.5.8. If R(z) is a cross section of B 
perpendicular to the z-axis, then R(z) is a circle with radius \fz. Thus, if A(z) 
is the area of R(z), we have 

A(z) = irz. 

If V is the volume of B, then 



V 



TTZtlz 



7T 

2' 



Exercise 2.5.9. Let T be the region bounded by z-axis and the graph of z = x 
for < x < 2. Find the volume of the solid B obtained by rotating T about the 
z-axis. 

Exercise 2.5.10. Let T be the region bounded by z-axis and the graph of 
z = x for < x < 1. Find the volume of the solid B obtained by rotating T 
about the z-axis. 

Example 2.5.6. Let T be the region bounded by the graphs of z = x and 
x = x 2 for < x < 1. Let B be the three-dimensional body created by rotating 
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Figure 2.5.8: Region T rotated about z-axis to create solid body B 

T about the z-axis. See Figure 2.5.9. If R{z) is a cross section of B perpendicular 
to the z-axis, then R(z) is the region between the circles with radii z« and *Jz, 
an annulus. Hence if A(z) is the area of R(z), then 

A(z) = 7T I Z * ) — 7T (\/z) = T^{\/~Z — z) . 

If V is the volume of B 7 then 



V 



tt(^/z — z)dz = ir —z 2 



3 2 



Tt 

6' 



Exercise 2.5.11. Let T be the region bounded by the curves z = x and z 
x 2 . Find the volume of the solid B obtained by rotating T about the z-axis. 




Figure 2.5.9: Region T rotated about z-axis to create solid body B 
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Oi,/(£i)) 



(x ,f(xo)) 




(x,.f(x 3 )) 



(X4.,f(X4.)) 
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a — Xq X\ 



X2 



%3 



Xi b = i, 



Figure 2.5.10: Approximating the length of a curve with N = 5 subintervals 



2.5.3 Arc length 

Consider a function / which is continuous on the closed interval [a, b]. Let C be 
the graph of / over [a, b] and let L be the length of C . To approximate L, we 
first divide [a, b] into N equal subintervals, each of length 



Ax 



b — a 



(2.5.20) 



and let xq = a, x\, X2, ■ ■ ■ , Xn = b be the endpoints of the subintervals. If Li is 
the length of the line from (x,_i, /(a;,_i)) to (xi, f{xi)), for i = 1,2, ... ,N (see 
Figure 2.5.10), then 



L«Li+L 2 



jv 
Lat = 2_^ Li. 

4 = 1 



Since 



^ = Vte - *i-i) 2 + (/(a*) - /(n-i) 2 = V(A^) 2 + (Ay,) 2 



where 



we have 



Ay, = /(x») - f(xi-i) = /(x,--i + Arc) - /(a;»_i), 



A? 



JV 



L«^V(A I ) 2 + (A^ = ^Jl 



(=i 



i=l 



Ay,- 

Ax 



Ax. 



(2.5.21) 

(2.5.22) 
(2.5.23) 

(2.5.24) 
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0.5 — 



0.5 



Figure 2.5.11: Graph of y = x 2 over [0, 1] 



Now we should expect the approximation in (2.5.24) to become exact when 
N is infinite. That is, for N a positive infinite integer, let 



dx 



b — a 



and 

If the shadow of 



dy= f(x + dx) - f(x). 



N 



El' 



dx. 



(2.5.25) 
(2.5.26) 

(2.5.27) 



is the same for any choice of N, then we call (2.5.27) the arc length of C. Now 
if / is differentiable on an open interval containing [a, b], and /' is continuous 
on [a, b], then (2.5.27) becomes the definite integral of y/l + (f'(x)) 2 . That is, 
the arc length of C is given by 



L 



s/1 + (/'(x)) 2 dx. 



(2.5.28) 



Example 2.5.7. Let C be the graph of f{x) =i 2 over the interval [0, 1] (see 



Figure 2.5.11) and let L be the length of C. Since f'{x) = \\fx, we have 

.1 



Now 



9 
I + —x dx. 



r~ j 2 3 

'x ax = —x 2 + c, 

o 
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so we might expect an integral of \/l + § to be 



However, 



2 / 9 
- [1 + -x 

3 V 4 



d 2 / 9 

1 + -a; 

dx3 V 4 



9/9 



and so, dividing our original guess by |, we have 



, 9 j 8 (■, 9 

1 H — a; da; = — 1 H — x 

4 27 V 4 



which may be verified by differentiation. Hence 



/ 

Jo 


V x + i x 


r/.? 

3 


8 
27 


(-H 


2 


8 


/l3\/l3 


-1 


27 


\ « 


13VT3-8 






27 




1.4397. 





c, 



Example 2.5.8. Let C be the graph of f(x) = x 2 over the interval [0, 1] (see 
Figure 2.5.12) and let L be the length of C . Since /'(x) = 2a;, 



Vl + 4x 2 dx. 



However, we do not have the tools at this time to evaluate this definite integral 
exactly. Still, we may use (2.5.24) to find an approximation for L. For example, 
if we take N = 100 in (2.5.24), then Ax = 0.01 and 

A Vi = /(0.01*)-/(0.01(*-l)) 
= (O.Oli) 2 - (0.01(i- l)) 2 
= 0.0001(« 2 - (i 2 -2i+l)) 
= 0.0001(2^-1) (2.5.29) 

, N, and so 

100 

L » J2 V(Ax) 2 + (A^) 2 w 1-4789. 



for i = 1,2,. 



i=i 
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0.5 — 




Figure 2.5.12: Graph of y = x 2 over [0, 1] 



We will return to this example in Examples 2.6.19 and 2.7.9 to find an exact 
expression for L. 



Exercise 2.5.12. Let C be the graph of y = Is 2 over the interval [1,3]. Find 
the length of C. 

Exercise 2.5.13. Let C be the graph of y = sin(x) over the interval [0,7r]. 
Use (2.5.24) with TV = 10 to approximate the length of C. 



2.6 Some techniques for evaluating integrals 

2.6.1 Change of variable 

If F is an integral of / and tp is a differentiable function, then, using the chain 
rule, 

^F(<p(x)) = F'(cp(x))<p'(x) = fMxW(x). (2.6.1) 

Written in terms of integrals, we have 

f(<p(x))<p'(x)dx = F(tp(x)) + c. (2.6.2) 



If we let u = <p(x) and note that 



f(u)du = F(u) + c, 



(2.6.3) 
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we may express (2.6.2) as 

/(.;(>))/(,-'),/,-= / /(„),/,/. (2.(i.4) 

That is, we may evaluate 



f(<p(x))tp'(x)dx, 

by changing the variable to u = f[x), with ip'(x)dx becoming du since 



du 

— = y{x). 

dx 



Example 2.6.1. To evaluate 



lei 



2x\J 1 + x 2 dx : 



u = l + x 2 
du = 2xdx. 



Then 



2z"\/l + x 2 dx = \ y/u du = -u 2 + c = -(1 + x 2 ) 2 + c. 



Example 2.6.2. To evaluate 



lei 



xsin(:c )dx, 



u = x 
du = 2xdx 

Note that in this case we cannot make a direct substitution of u and du since 
du = 2xdx does not appear as part of the integral. However, du differs from 
xdx by only a constant factor, and we may rewrite du = 2xdx as 

-du = xdx. 

2 

Now we may perform the change of variable: 

If 1 1 

X sin(u)dx = — / sm(u)du = — - cos(w) + c = — - cos(a; 2 ) + c. 

Z J z z 
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Example 2.6.3. Note that we could evaluate the integral 

/ cos(4x)dx 
using the substitution 



u = Ax 
du = Adx, 



which gives us 



1 /" 1 1 

cos(4x)dx = — / cos(u)du = — sin(u) + c = — sin(4x) + c. 

However, it is probably quicker, and easier, to guess that the integral of cos(4:r) 
should be close to sin(4a;), and then correcting this guess appropriately after 
noting that 

— sin(4x) = 4cos(4a;). 
dx 

Example 2.6.4. To evaluate 



/' 



cos (5x) sin(5x)dx, 

let 

u = cos(5a;) 
du = —5sm(5x)dx. 

Then 

If 1 1 
cos (5a;) s'm(5x)dx = — u du = u + c = cos (5a;) + c. 

O J lo 10 

Now consider the definite integral 

,-b 
f((p(x))ip'(x)dx. 

If F is an integral of /, c = ip(a), and d = ip(b), then 



/ 



b 

f(Lp(xW(x)dx=F(<p(x))\ b a 



F(<p(b)) - F(<p(a)) 

F(d) - F(c) 

F(u)\ d c 

d 

f{u)du. (2.6.5) 

That is, we may use a change of variable to evaluate a definite integral in the 
same manner as above, the only difference being that we must change the limits 
of integration to reflect the values of the new variable u. 
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Example 2.6.5. To evaluate 

1 T. 

dx, 



V / T+^ 



let 







u = 1 + x 2 








du = 2xdx. 




Note that when x 


= 0, u = 1, and when x = 1, u = 2. 


Hence 




f 

Jo 


— dx = — 1 — f= <iu = \/u| , 


= V2 


Example 


2.6.6. 


To evaluate 

/ cos (2x) sin(2x)dx, 

^0 




let 




w = cos(2x) 
du = — 2 sin(2x)dx. 




Then u = 


1 when 


x = and w = when x = ? , so 








/ cos (2x) sin(2x)rfa; = / ii 


2 rfw. 



Note that, after making the change of variable, the upper limit of integration is 
less than the lower limit of integration, a situation not covered by our definition 
of the definite integral or our statement of the fundamental theorem of calculus. 
However, the result on substitutions above shows that we will obtain the correct 
result if we apply the fundamental theorem as usual. Moreover, this points 
toward an extension of our definition: if b < a, then we should have 

b fa 

f{x)dx = - f{x)dx, (2.6.6) 

Jb 

which is consistent with both the fundamental theorem of calculus and with the 
definition of the definite integral (since, if b < a, dx = -^ < for any positive 
infinite integer N). With this, we may finish the evaluation: 



1 r I , 

1 ' .2 7,. _ 1 / „ 2 



/ cos (2x) sm(2x)dx = — / u 2 du= - I u z du 
Jo 2 A 2j 6 



1 = 1 
o 6- 



2.6. SOME TECHNIQUES FOR EVALUATING INTEGRALS 95 

Exercise 2.6.1. Evaluate / 3x 2 \/l + x 3 dx. 

Exercise 2.6.2. Evaluate / aV^T^ dx . 

Exercise 2.6.3. Evaluate / sec 2 (3x) ta,n 2 (3x)dx. 

Exercise 2.6.4. Evaluate / — ^=^= dx. 

Jo \/4+a; 2 

7T 

fe 
Exercise 2.6.5. Evaluate / sm(3x)dx. 

Jo 

Exercise 2.6.6. Evaluate / sin 4 (2:c) cos(2x)dx. 

Jo 

2.6.2 Integration by parts 

Suppose u and v are both differentiable functions of x. Since, by the product 
rule, 

d dv du 

—uv = u- — \-v—, (2.6.7) 

dx dx dx 

we have 

dv d du 

u — = — uv — v — . (2.6.8) 

dx dx dx 

Hence, integrating both sides with respect to x, 

dv f d f du f du 

u — dx = / — uv — / v — = uv — / v — dx, (2.6.9) 

dx J dx J dx J dx 

which we may write as 

/ udv = uv — vdu. (2.6.10) 

This last formulation, known as integration by parts, is useful whenever the 
integral on the right of (2.6.10) is in someway simpler than the integral on the 
left. The next examples will illustrate some typical cases. 

Example 2.6.7. Consider the integral 

xcos(x)dx. 



If we let u = x and dv = cos(x)dx, then du = dx and we may let v = sin(x). 
Note that we have some choice for v since the only requirement is that it is an 
integral of cos(a;). Using (2.6.10), we have 

x sin(x)dx = uv — vdu = x sin(a;) — / sin(x)dx = x sin(a;) + cos(x) + c. 
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In evaluating a definite integral using integration by parts, we must remem- 
ber to evaluate each piece of the integral. That is, 

,-b 
udv = uv\ a — / vdu. (2.6.11) 



Example 2.6.8. To evaluate 



let 



X s'm(x)dx, 
o 



u = x dv = s'm(x)dx 

du = 2xdx v = — cos (a;). 

Then, using (2.6.11), 

x s'm(x)dx = —x cos(x) L + / 2xcos(x)dx = tt + / 2xcos(x)dx. 





Note that the final integral is simpler than the integral with which we started, 
but still requires another integration by parts to finish the evaluation. Namely, 
if we now let 

u = 2x dv = cos(x) 

du = 2dx v = sin(x), 

we have 



x sm(x)dx = tt + 2a;sin(x)|Q — / 2sm(x)dx 



2, 

X 

» 

= tt 2 + (0-0)+ 2cos(a;)|o 



= n 2 


-2- 


-2 




= n 2 


-4. 






Example 2.6.9. To evaluate 

f 1 


Wi 








+ x 


dx, 


Jo 








let 








u = X 


dv = y/l + x dx 


du = dx 






2 /„ \ 3 

V= -(1 + X)5. 



2.6. SOME TECHNIQUES FOR EVALUATING INTEGRALS 



97 



Then 



x\/l + x dx = —x{\ + x) 2 
u 3 



2 
3' 

4\/2 



i 2 ,i 



(1 + x) 2 dx 



t(1 + x)' 



3 15 

4\/2 16\/2-4 
~~3 15 

4\/2 + 4 
15 ' 



Exercise 2.6.7. Evaluate / xsin(2a;)fia; 



Exercise 2.6.8. Evaluate /s»coe(3*)«fc. 



Exercise 2.6.9. Evaluate / a; cos I -x ) dx 



o 



Exercise 2.6.10. Evaluate / 3x cos(x)dx. 

Jo 

Exercise 2.6.11. Evaluate / x 2 \J\ + x dx. 

Jo 

2.6.3 Some integrals involving trigonometric functions 

The next examples will illustrate how various identities are useful in simplifying 
some integrals involving trigonometric functions. 

Example 2.6.10. To evaluate the integral 



sin (x)dx, 



we will use the half-angle formula: 

sin 2 (2) 
Then 



., 1 — cos(2x) 



(2.6.12) 



/ sm 2 (x)dx = — I 
Jo 2 J 



sin (x)dx = - / (1 — cos(2x))dx 
2 Jo 



2 

7T 
2' 



■ sin(2a;) 
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There is also a half- angle formula for cosine, namely, 



cos 2 (x) 



1 + cos(2x) 



(2.6.13) 



As illustrated in the next example, we may use the half-angle formulas recur- 
sively to evaluate the integral of any even power of sine or cosine. 

Example 2.6.11. Using 2.6.13 twice, we have 



/ cos (3x)dx = / (cos 2 (3x)) dx 
Jo Jo 



* /l 



-(1 + cos(6x)) I dx 



1 

4 J, 
1 



(1 + 2 cos (6a;) + cos (6x))dx 



* o 

IT 1 

4 + 8 : 
3tt 



12 



sin(6a;) 



+ - / (1 +cos(12a;))cfe 
o 8 -'o 



!)(, 



sin(12:r) 



Exercise 2.6.12. Evaluate / sin 2 (2x)dx. 

Jo 

Exercise 2.6.13. Evaluate / cos (3x)dx. 

Jo 

Exercise 2.6.14. Evaluate J cas\ X )*c. 

The next example illustrates a reduction formula. 
Example 2.6.12. Suppose n > 2 is an integer and we wish to evaluate 

sm n (x)dx. 



dv = sin(x)dx 
v = — cos(x), 



We begin with an integration by parts: if we let 
u = sin"~ (x) 
du = (n — 1) sin ra_ (x) cos(x)dx 
then 



sin n (x)dx = —sin™ (x)cos(cc) „ + {n — 1) / sin™ (x) cos (x)dx 

(i Jo 



(n — 1) / sin™ (x) cos 2 (x)dx. 
Jo 
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Now cos 2 (a;) = 1 — sin (x), so we have 

/ s'm n (x)dx = (n - 1) / sm n ~ 2 (x)(l - sin 2 (x))dx 
Jo Jo 

= (n - 1) / sin"" 2 (2:)da; - (n - 1) / sin n (x)dx. 
Jo Jo 

Notice that L sm n (x)dx occurs on both sides of this equation. Hence we may 
solve for this quantity, first obtaining 

fir pir 

n I sm n (x)dx = (n - 1) / sin"~ 2 (a;)da;, 



f 

■Jo 



and then 

n - 1 r 

s\n n (x)dx = / sin n_2 (x)rfa;. (2.6.14) 

io n Jo 

Note that, although we have not yet found the value of our integral, we have 
reduced the power of sin(x) in the integral. We may now use (2.6.14) repeatedly 
to reduce the power of sin(x) until we can easily evaluate the resulting integral. 
For example, if n = 6 we have 

5 I' 7 * 
sin (x)dx = — / sin (x)dx 
o 



6 Jo 

$in 2 (x)dx 



53 , . 2 



6 4 J 

Hi.' dx 

16' 



i r 

2Jo ' 



Similarly, 



r* 4 f* 

/ sin (x)dx = - sin (x)dx 
Jo 5 Jo 

42 r 

= — / sm(x)dx 
5 3 J 



— cos(a;) 



16 
15' 



Exercise 2.6.15. Use the reduction formula (2.6.14) to evaluate 

sin (x)dx. 



100 CHAPTER 2. INTEGRALS 

Exercise 2.6.16. Use the reduction formula (2.6.14) to evaluate 

sin (x)dx. 



Exercise 2.6.17. Derive the reduction formula 

cos"(:r) = / cos n ~ 2 (x)dx, 

o n J 

where n > 2 is an integer. 

Exercise 2.6.18. Use the reduction formula of the previous exercise to eval- 
uate 

/*7T 

cos 6 (x)dx. 
o 



Exercise 2.6.19. Derive the reduction formulas 

/.„,,. 1 . „_ X/ , , , n-1 f . n _ 2 . ,, 

/ sin (x)dx = sin (x)cos(xjH / sin (x)dx 

J n n J 



and 



cos n (x)dx = — cos™ 1 (x)sm(x)-\ /cos™ 2 {x)dx, 

n n 



where n > 2 is an integer. 

Example 2.6.13. An alternative to using a reduction formula in the last ex- 
ample begins with noting that 

I sin (x)dx = / sin (x) sin (x)dx 
Jo Jo 

r 

(sin (x)) sm(x)dx 

u 

r 

(1 — cos (x)) sin(x)dx 

ii 

(1 — 2 cos (x) + cos (x)) sm(x)dx. 
/o 

The latter integral may now be evaluated using the change of variable 



u = cos(x) 
du = — sin(x)<ix, 
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giving us 

-l 



sin b (x)dx = - (1 - Ivr + vr)du 

u Ji 

l 

(l-2u 2 + u A )du 
■ l 

l 



2 , 1 

u u H — u 

3 5 



-l 



2 1\ / 2 1 

X -3 + 5 J" - 1+ 3-5 



16 

15' 



as we saw above. 



Exercise 2.6.20. Evaluate / cos 5 (2x)dx. 

Jo 



Our next example relies on a trigonometric identity for sin(ax) cos(6a;), where 
a and b are both real numbers. We first note that, using the angle addition and 
subtraction formulas for sine, 

sin((a + b)x) = sin(a:r) cos(&;c) + sin(6x) cos(aj;) (2.6.15) 

sin((a — b)x) = sm(ax) cos(bx) — sin(6a;) cos(bx). (2.6.16) 

Adding these together, we have 

2 sin(ax) cos(6a;) = sin((a + b)x) + sin((a — b)x), (2.6.17) 

and so 

sin(ax) cos(6a;) = -(sin((a + b)x) + sin((a — b)x)). (2.6.18) 

Example 2.6.14. To evaluate 

r 

I sin(2x) cos(?>x)dx, 
Jo 

we first note that, using (2.6.18) with a = 2 and 6 = 3, 

sin(2a;) cos(3x) = -(sin(5x) +sin(— x)) = -(sin(5x) — sin(a;)). 
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Hence 



sin(2a;) cos(3x)dx 



sm(5x)dx 



sin(x)dx 



1 . , 

— cos(5a;) 
10 v ; 


71 1 

+ t; cos ( x ) 
o z 


1 1\ / 1 l\ 
10 + 10/ + v 2 V 


4 


5' 





For integrals involving sin(aa;) sin(6x), we begin with the the angle addition 
and subtraction formulas for cosine, 

cos((a + b)x) = cos{ax) cos(bx) — sin(6:r) sin(aa;) (2.6.19) 

cos((a — b)x) = cos(ax) cos(bx) + sin(fa:) sin(6a;). (2.6.20) 

Subtracting the first of these from the second, we have 

2sin(aa;) s'm(bx) = cos((a — b)x) — cos((a + b)x), (2.6.21) 

and so 

sin(aa;) sin(6a;) = — (cos((a — b)x) — cos((a + b)x)). (2.6.22) 

Example 2.6.15. To evaluate 



sin(3x) sm(5x)dx, 



we first note that, using (2.6.22) with a = 3 and 6=5, 

sin(3a;) sin(5x) = — (cos(— 2x) — cos(8x)) = -(cos(2a;) — cos(8a;)). 

Note that we would have the same identity if we had chosen a = 5 and 6 = 3. 
Then 



sin(3a;) sin(5a;)da; 



1 

2 
1 

4 
0. 



cos(2x)dx 



1 



cos(8x)dx 



sin(2x) 



16 



For integrals involving cos(aa:) cos(6a;), we add (2.6.19) to (2.6.20) to obtain 
2cos(ax) cos(6x) = cos((a + b)x) + cos((a — b)x), (2.6.23) 

which leads to 

(2.6.24) 



cos(aa;) cos(bx) = -(cos((a + b)x) + cos((a — b)x)). 
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Example 2.6.16. To evaluate 

cos(3:c) cos(5x)dx, 



we note that, using (2.6.24) with a = 3 and 6 = 5, 

cos(3a;) cos(5a;) = -(cos(8x) +cos(— 2x)) = -(cos(8x) + cos(2ir)). 



Hence 



1 [* 1 f* 

cos(3a;) cos(5x)dx = - / cos(8x)dx -\ — / cos(2x)dx 
ii 2 J 2 J 



1 . , 
— sin(8x) 
16 v ; 

0. 



1 . . 
- sin(2a;) 
4 



Exercise 2.6.21. Evaluate / sin(x) cos(2x)dx. 

Jo 

Exercise 2.6.22. Evaluate / sin(a;) sin(2x)dx. 

Jo 



Exercise 2.6.23. Evaluate 



/ sin(3:r) cos(3x)da 
Jo 



'o 
Note: This may be evaluated with a substitution. 

Exercise 2.6.24. Evaluate / cos(x) cos(2x)dx. 

Jo 

Exercise 2.6.25. For any positive integers m and n, show that 

/ sin(ma;) cos(nx)dx = 0, 
Jo 

■ t \ ■ i \a JO, ifm^n, 
sm[mx) sm(nx)dx = < 
,1 I 7r if m = n, 

and 

' j7r JO, if m^n, 

cos{mx) cos(nx)dx = < 

7r, if m = n. 
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0.75 




0.2 r » 



Figure 2.6.1: Region beneath y = \/l — x 2 over the interval [0, 1] 



2.6.4 Change of variable revisited 

Suppose / is a continuous function on the interval [a, b] and ip is either an 
increasing function defined on an interval [c, d] with tp(c) = a and ip(d) = b, or 
a decreasing function defined on [c,d] with tp(c) = b and <p(d) = a. Then, by 
(2.6.5), changing the notation as necessary, 



6 ,-d 

f(x)dx = / f(ip(z))tp'(z)dz. 



(2.6.25) 



Earlier we used (2.6.25) to simplify integrals in the form of the right-hand side; in 
this section we will look at some examples which simplify in the other direction. 



Example 2.6.17. Since the graph of y = yl — x 2 for < x < 1 is one-quarter 
~ — 1 (see Figure 2.6.1), we know that 



of the circle x 2 + y 



V 1 — x 2 dx = —. 
o 4 

We will now see how to use a change of variable to evaluate this integral using 
the fundamental theorem. The idea is to make use of the trigonometric identity 
1 — sin (z) = cos 2 (z). That is, suppose we let x = sin(z) for < z < %. Then 

V 1 — x 2 = y 1 — sin 2 (z) = -\/cos 2 (z) = | cos(z)| = cos(z), 
where the final equality follows since cos(,2) > for < z < ?. Now 

dx = cos(z)dz 1 
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y 






2 




-~^H = \/l - 1 


/ 0.8- 








\ 0.6- 










\ 0.4- 


TV 
" 4 / 








\0.2- 








h- 


1 1 e^ 


^- 1— 




H 1 



"71 -°- 5 



0-5 7i 



Figure 2.6.2: Arc of x 2 



1 between 



(-75'75) and (7 



V2> V2 



so we have 



\/l — x 2 dx = / cos(z) cos(z)dz 

ii Jo 



cos {z)dz 
o 

1 /" f 

2 



(1 +cos(2z))dz 



o 



1 
2' 

7T 

4' 



1 



sin(2z) 



as we expected. 



Example 2.6.18. Let C be the circle with equation x 2 + y 2 = 1 and let L 
be the length of the shorter arc of C between I — j^^~m) an d ( ~ts , ~7s ) (see 
Figure 2.6.2). Since the circumference of C is 2tt and this arc is one-fourth of 
the circumference of C, we should have L = ^. We will now show that this 
agrees with (2.5.28), the formula we derived for computing arc length. Now 
y = \/l — x 2 , so 



dy 

d.r 



1 



{l- x 2 yi(-2x) 



vT 
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Hence 



2 



, dy\ / 1 a; 2 /l — x 2 + x 



dx J V 1 — a; 2 V 1 — x 2 \/\ — x 2 



Hence, by (2.5.28), 



If we let 



dx. 

i VI - a; 2 



a; = sin(z) 
da; = cos(z)dz, 



then 



: cos(z)dz 



zdz 



4 \J 1 — sin (z) 

1 cos(,j) 
f v/cos 2 ^) 
* cos(z) 
s cos(z) 

4 V / 

4T 



rfz 



IT 

2' 



Exercise 2.6.26. Use the change of variable x = 2sin(z) to evaluate 

1-2 



V 4 — a; 2 <fa;. 

-2 



r 1 2 

Exercise 2.6.27. Evaluate / - dx 

J - 2 Vl6 - x 2 

Example 2.6.19. In Example 2.5.8 we saw that the arc length L of the parabola 
y = x 2 over the interval [0, 1] is 

L = / y/l + 4x 2 dx. 
Jo 

However, at that point we did not have the means to evaluate this integral. We 
now have most, although not all, of the necessary tools. To begin, we will first 
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make the change of variable 

u = 2x 
du = 2dx, 

which gives us 

L = - / VI + u 2 du. 



2 Jo 
Next, we recall the trigonometric identity 

1 + tan 2 (£) = sec 2 (t) (2.6.26) 

(a consequence of dividing each term of the identity cos 2 (t) + sin (t) = 1 by 
cos 2 (£)), which is a hint that the change of variable 

x = taxi(z) 
dx = sec (z) 

might be of use. If we let a be the angle for which tan(a) = 2, with < a < ^, 
and note that tan(0) = and 



yl + tan 2 (z) = \/sec 2 (z) = |sec(z)| = sec(z) 
(note that sec(^) > since < z < ?), then 

1 f a 1 f a 

L= - sec(z)sec (z)dz = - / sec (z)dz. 

We may reduce the integral on the right using an integration by parts: Letting 

u = sec(z)dz dv = sec (z)dx 

du = sec(z) tan(z)dz v = tan(z), 

we have 

sec (z)dz = sec(z) tan(^)|p — / sec(z)tan (z)dz 



o 



sec(a) tan(a) — / sec(z)(sec (z) — l)dz 
Jo 

roc /-a 

2v 5 — / sec (z)dz + / sec(z)dz, 



where we have used the fact that tan(a) = 2 and 1 + tan 2 (a) = sec 2 (;z) to find 
that sec(a) = V5- It now follows that 

2 / sec 3 (z)dz = 2V5 + / sec(z)dz, 
Jo Jo 
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and so 

a s r 1 r 

sec (z)dz = v5 H — / sec(z)ck. 
2 Jo 

Hence 

t v/s i r 

L = — / sec (z )dz. 

For this reduced integral, we notice that 

. . , f a , , sec(z) + tan(z) , f a sec 2 (z) + sec(z) tan(z) , 

sec(z)dz= / sec(z) — — — -r^-dz = / -— — — rfz, 

o J sec(zj + tan(z) J sec(zj + tan(z) 

and so the change of variable 

w = sec(z) + tan(z) 
dw = (sec(2;) tan(z) + sec (z))dz 



gives us 



Thus we now have 



/•2+V5 i 

sec(z)dz = / —dw. 

o Ji w 

2 4 7i w 

Although greatly simplified from the integral with which we started, neverthe- 
less we cannot evaluate the remaining integral with our current tools. Indeed, 
we may use the fundamental theorem of calculus to evaluate, for any rational 
number n, any definite integral involving w n , except in the very case we are 
facing now, that is, when n = — 1. We will fill in this gap in the next section, 
and finish this example at that time (see Example 2.7.9). 

Example 2.6.20. For a simpler example of the change of variable used in the 
previous example, consider the integral 

the area under the curve 

1 
V 



l + x 2 
over the interval [—1, 1] (see Figure 2.6.3). If we let 

x = tan(z) 
dx = sec (z)dz, 
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1 -0.75 -0.5 -0.25 0.25 0.5 0.75 1 



Figure 2.6.3: Region beneath y = 1 * 2 over the interval [— 1, 1] 



and note that tan (—j) = —1 and tan (j) = 1, then 



i l + z 2 



rd,X 



1 



n 1 + tan 2 (z) 
1 sec 2 (z) 



sec 2 (z)dz 



dz 



sec 2 (z) 
dz 



IT 

~ 2 ' 

You should compare this with the simple approximation we saw in Example 
2.3.1. 



Exercise 2.6.28. Evaluate 



Exercise 2.6.29. Evaluate 



(3 



_ 3 9 + X 2 

1 



dx. 



„i 1 +Ax 2 



dx. 



Exercise 2.6.30. Show that for any positive integer n > 2, 



sec n (x)dx 



1 ' sec (cc)oa;. 



n- 1 



sec" (x) tan(x) + 



n — 1 
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2.7 The exponential and logarithm functions 

There are many applications in which it is necessary to find a function y of a 
variable t which has the property that 

for some constant real number k. Examples include modeling the growth of 
certain animal populations, where y is the size of the population at time t and 
k > depends on the rate at which the population is growing, and describing 
the decay of a radioactive substance, where y is the amount of a radioactive 
material present at time t and k < depends on the rate at which the element 
decays. We will first consider that case k = 1; that is, we will look for a function 
y = f(t) with the property that /'(£) = f(t). 

2.7.1 The exponential function 

Suppose / is a differential function on (— oo, oo) with the property that /'(£) = 
/(£) for all t (one may show that such a function does indeed exist, although we 
will not go into the details here). Now /'(£) = /(£) implies, by the fundamental 
theorem of calculus, that 

f f{x)dx = f f(x)dx = /(*) - /(0) (2.7.2) 

Jo Jo 

for all t. The value of /(0 is arbitrary; we will find it convenient to take /(0) = 1. 
That is, we are now looking for a function / which satisfies 

f(t) = l+ f f(x)dx. (2.7.3) 

Jo 

Suppose we divide [0, t) into N subintervals of equal length Air = -^, where N is 
a positive integer, and let xo, Xi, x%, . . . , xn be the endpoints of these intervals. 
Now for any i = 1,2, ... ,N, using (2.7.3), 

f( Xi ) = 1 + / * f(x)dx 

Jo 

/•Xi-i rXi 

= 1 + / f(x)dx + / f(x)dx 

JO JX!-! 

= f{xi-i) + f * f(x)dx. (2.7 A) 

J Xl-l 

Moreover, for small Ax, 

f(x)dx w f( Xi -i)Ax, (2.7.5) 

Xl_l 



(2.7.7) 


(2.7.8) 


(2.7.9) 


(2.7.10) 


(2.7.11) 


(2.7.12) 


(2.7.13) 
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and so we have 

f(xi) w /(Si_i) + f(xi-i)Ax = /(ii_i)(l + As). (2.7.6) 

Hence we have 

/(a ) = /(0) = 1, 
/(xiW(x )(l + Ax) = l + Ax, 
f(x 2 ) » /(xi)(l + Ax) « (1 + Ax) 2 , 
/(x 3 )«/(x 2 )(l + Ax)«(l + Ax) 3 , 
/(x 4 )«/(x 3 )(l + Ax)«(l + Ax) 4 , 

/(*jv) « /(**-i)(l + Ax) » (1 + As)" 
Now xn = t and Ax = ^, so we have 

Moreover, if we followed the same procedure with iV infinite and dx = 4, , we 
should expect (although we have not proved) 

/(t)~(l + ] . (2.7.15) 

We will let e = /(l). That is, 

e = sh(l+l) , (2.7.16) 

where N is any positive infinite integer. We call e Euler's number. Now if t is 
any real number, then 

f®=( 1+ Jf) N =l ( 1+ f) I =et ' ( 2 - 7 - 17 ) 

where we have used the fact that y is infinite since N is infinite and t is finite. 
(Note, however, that " is not an integer, as required in (2.7.16). The statement 
is nevertheless true, but this is a detail which we will not pursue here.) Thus 
the function 

/(*) = e* (2.7.18) 

has the property that f'(t) = f(t), that is, 

y-*. (2.7.19) 
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In fact, one may show that f(t) = e* is the only function for which /(0) = 1 
and f'(t) = f{t). We call this function the exponential function, and sometimes 
write exp(t) for e*. 

It has been shown that e is an irrational number. Although, like n, we 
may not express e exactly in decimal notation, we may use (2.7.16) to find 
approximations, replacing the infinite N with a large finite value for TV. For 
example, with N = 200, 000, we find that 

, n 200000 

H ) « 2.71828, (2.7.20) 

200000 J ' V ; 

which is correct to 5 decimal places. 

Example 2.7.1. If f(t) = e 5 *, then / is the composition of h(t) = 5t and 
g(u) = e u . Hence, using the chain rule, 

f'(t)=g'(h(t))ti(t) = e 5t -5 = 5e 5t . 

In general, if h(t) is differentiable, then, by the chain rule, 



d 



-e 



h(t) _ h /( f \Mt) 



h'(t)e hW . (2.7.21) 



dl 
Example 2.7.2. If f{x) = 6e~ x2 , then 

f{x) = -12xe- x2 . 

Exercise 2.7.1. Find the derivative of g(x) = \2e~ 7x . 
Exercise 2.7.2. Find the derivative of /(£) = 3t 2 e~*. 

Example 2.7.3. Let f(t) = e* and g(t) = e~ l . Since e* > for all t, we 
have f'(t) = e > and fit) = e > for all t, and so / is increasing on 
(— oo,oo) and the graph of / is concave upward on (— oo,oo). On the other 
hand, g'{i) = — e - * < and g"(t) = e~ l > for all t, so g is decreasing on 
(— oo,cx>) and the graph of g is concave upward on (—00,00). See Figure 2.7.1. 

Of course, it follows from (2.7.19) that 

e t dt = e t + c. (2.7.22) 

Example 2.7.4. From what we have seen with the examples of derivatives 
above, we have 

e^dt = -e"^ = -e _1 + e° = 1 - e^ 1 w 0.6321. 
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Figure 2.7.1: Graphs of y = e and y = e 



Example 2.7.5. To evaluate 



dx, 



we will use the change of variable 



Then 



u = —x 
du = —2xdx. 



xe dx = — — / e u du = —-e u + c 
z j z 



-e + c. 



Example 2.7.6. To evaluate 



xe 2x dx 



we will use integration by parts: 
u = x 
du = dx 



dv = e 2x dx 
1 



v 



2x 



Then 



/ 



xe 2x dx = --xe 2x + - e 2x dx = --xe 2x - -e 2x + c. 
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Exercise 2.7.3. Evaluate / 5e 2x dx. 

o 

Exercise 2.7.4. Evaluate / x 2 e~ x dx. 



Exercise 2.7.5. Evaluate / x 2 e x dx. 



Note that if y = ae kt , where a and k are any real constants, then 

kae kt = ky. (2.7.23) 



dy , a; 



dt 

That is, y satisfies the differential equation (2.7.1) with which we began this 
section. We will consider example applications of this equation after a discussion 
of the logarithm function, the inverse of the exponential function. 

2.7.2 The logarithm function 

The logarithm function is the inverse of the exponential function. That is, for 
a positive real number x, y = log(a;), read y is the logarithm of x, if and only if 
e y = x. In particular, note that for any positive real number x, 

e log(x) = x, (2.7.24) 

and for any real number x, 

\og(e x ) = x. (2.7.25) 

Also, since e° = 1, it follows that log(l) = 0. 

Since log(a;) is the power to which one must raise e in order to obtain x, 
logarithms inherit their basic properties from the properties of exponents. For 
example, for any positive real numbers x and y, 

log(xy) = log(cc) + log(y) (2.7.26) 

since 

e \o g (x)+\og{y) = e log(z) e logfo) = xy ^ (2.7.27) 

Similarly, for any positive real number x and any real number a, 

log(a; a ) = alog(x) (2.7.28) 



since 



3 alog(x) = Llog(x)\ a = x a_ (2.7.29) 
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Exercise 2.7.6. Verify that for any positive real numbers x and y, 



logl-l=log(aO-log(y). (2.7.30) 

Note in particular that this implies that 

log(-j=-log(y). (2.7.31) 

To find the derivative of the logarithm function, we first note that if y = 
log(x), then e y = x, and so 

d ., d 







dx 


dx 


Applying the chain rule, 


it follows that 








e ydy_ _ 

dx 


-- 1. 


Hence 




dy_l_ 


1 






dx e y 


X 


Theorem 2.7.1. 


For any real number x 


>o, 






— log(x) 

dx 


1 

X 



(2.7.32) 



(2.7.33) 



(2.7.34) 



(2.7.35) 



Example 2.7.7. Since, for all x > 



— log(x) = - > 

dx x 

and 

d 2 , , , 1 
— -^log(x) = j < °' 

the function i/ = log(x) is increasing on (0, tx>) and its graph is concave down- 
ward on (0,oo). See Figure 2.7.2. 



Example 2.7.8. If f(x) = log(x 2 + 1), then, using the chain rule 

/'(*) = -^t-tV + D "' 



x 2 + 1 dx x 2 + 1 

Exercise 2.7.7. Find the derivative of f(x) = log(3x + 4). 
Exercise 2.7.8. Find the derivative of y = (x + 1) log(x + 1). 
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Figure 2.7.2: Graph of y = log(a;) 



Using the fundamental theorem of calculus, it now follows that, for any 
x > 0, 

^t=log(t)|^ = log(x)-log(l)=log(^). (2.7.36) 

This provides a geometric interpretation of log(a;) as the area under the graph 
of y = 4 from 1 to x. For example, log(10) is the area under the graph of y = j 
from 1 to 10 (see Figure 2.7.3). 

Example 2.7.9. We may now complete the example, discussed in Example 
2.5.8 and continued in Example 2.6.19, of finding the length L of the graph of 
y = x 2 over the interval [0, 1]. In those examples we found that 



Jo 



1 + Ax 2 dx 



V5 

2 



2+v^ 



-dw. 



w 



Now we see that 



r?+V5 



\ -dw = log(w)\ 2+V5 = log(2 + A), 

Ji w 



and so _ 

L=^+ilog(2 + A). 

Rounding to four decimal places, this gives us L « 1.4789, the same approxi- 
mation we obtained in Example 2.5.8. Note, however, the advantage of having 
an exact expression for the answer: We may use the exact expression to easily 
approximate L to however many digits we desire, whereas we are unsure of the 
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Figure 2.7.3: Area under the graph of y 



precision of our original approximate result, and would need to recalculate the 
approximating sum whenever we wanted to try to improve upon our accuracy. 
Moreover, our procedure for finding the exact expression for L may be extended 
easily to find an expression for the length of any segment of the parabola y = x 1 . 



Example 2.7.10. To evaluate 



dx, 



/o l + x 2 
we first make the change of variable 

u = 1 + x 2 



Then 



l + x 1 



rdx 



du = 2xdx. 



-du 



log(u) 



log(2) 



Example 2.7.11. Evaluating 



log(x)dx 



provides an interesting application of integration by parts. If we let 

dv = dx 
v = x, 



u = log(x) 
1 



du = —dx 

x 
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then 

10 /.10 

log(x)dx = xlog(x)\ 1 — / dx = 101og(10) — 9. 
l J\ 

Example 2.7.12. We could use the change of variable u = 3x + 2 to evaluate 



5 4 



-dx, 



! 3a; + 2 ' 
or just make the appropriate correction for the chain rule: 



5 



4 , 4 



(ice = - log(3x + 2) 
„ 3a; + 2 3 M 



■"> 



II 



4 4/17 

-(log(17)-log(2)) = -logf y 



2 1 

Exercise 2.7.9. Evaluate / dx. 

lo x + 1 



: 2 ^ 

Exercise 2.7.10. Evaluate / — dx. 



! 3a; 2 + 4 

r 2 

Exercise 2.7.11. Evaluate / a;log(a;)da;. 

Exercise 2.7.12. Evaluate / V 1 + x 2 dx. 

f 1 1 
Exercise 2.7.13. Evaluate / dx 

J-l y/1 + x 2 

It is now possible to extend the power rule for differentiating x n . Suppose 
n 7^ is a real number and note that, for x > 0, 

x n = e log(x") = e «log(x)^ (2.7.37) 

Then 

—x n = c nl °z( x ) 
dx dx 

= e nlos{x) —nlog(x) 
dx 

_ ^ e nlog(a:) 

a; 

n n 

= -x 

X 

= nx n - 1 . (2.7.38) 

Thus we have our final form of the power rule. 
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Theorem 2.7.2. For any real number n ^ 0, if x > 0, 

^-x n = nx n -\ (2.7.39) 

ax 

Example 2.7.13. If f{x) = X* , then f'(x) = itx*' 1 

Exercise 2.7.14. Find 

— ir x 
dx 

by first writing n x = e xl ° 6 ^'. How does this result compare with the result of 
the previous example? 

2.7.3 Some applications 

As mentioned at the beginning of this section, there are many applications in 
which one desires to find a function y which, for some constant k, satisfies the 
differential equation 

§ = ky. (2.7.40) 

at 

Such an equation arises whenever the desired quantity grows, or decreases, at a 

rate which is proportional to its current value. As we saw above, a function of 

the form 

y = ae kt (2.7 .41) 

satisfies this equation for any real constant a. Moreover, it may be shown that 
any solution must be of this form. 

For example, (2.7.40) is used to model radioactive decay. That is, if one 
begins with y grams of a radioactive element and y is the amount of the element 
which remains after t years, then there is some constant k (which depends on 
the particular element being considered) for which 

tt = ^ (2 - 7 - 42) 

It follows that, for some real number a, 

y = ae kt . (2.7.43) 

Since we are given that y = y when t = 0, it follows that 

2/o = j/(0) = ae° = a. (2.7.44) 

Hence 

V = Voe kt . (2.7.45) 

Now suppose t\ < t<i are such that y at time £2 is one-half of y at time t\. 
Then 

ij/oe fetl =j/oe fet2 . (2.7.46) 
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Thus 

e ktl =2e kt2 , (2.7.47) 

and so 

p kti 

2 = — = e k{tl ~ t2) . (2.7.48) 

Hence 

log(2) = log (> (tl - t2) ) = k(h - t 2 ), (2.7.49) 

from which it follows that 

t 2 -t 1 = -^fl. (2.7.50) 

k 

Note that the right-hand side of (2.7.50) does not depend on t\, and so the time 
required for one-half of a radioactive element to decay does not depend on the 
initial amount of the element. We call this time the half -life of the element. 

Typically the rate of decay of a radioactive element is expressed in terms of 
its half-life. From (2.7.50), we see that if the half-life of a particular element is 
T, then the decay rate of the element is 

k = ~^f. (2.7.51) 

Example 2.7.14. Carbon-14 is a naturally occurring radioactive isotope of 
carbon with a half-life of 5730 years. A living organism will maintain constant 
levels of carbon-14, which will begin to decay once the organism dies and is 
buried. Because of this, the amount of carbon-14 in the remains of an organism 
may be used to estimate its age. For example, suppose a piece of wood found 
buried at an archaeological site has 14% of its original carbon-14. If T is the 
number of years since the wood was buried and j/o is the original amount of 
carbon-14 in the wood, then 

0.14y = y e kT , 

where, from (2.7.51), 

log(2) 



k 



5730 
It follows that 0.14 = e kT , and so 



Thus 



fcT = log(0.14). 



log(0.14) 57301og(0.14) ^ 1C „ 

T = = — — w 16, 253 years. 

k log(2) 



Exercise 2.7.15. Suppose a piece of wood buried at an archaeological site has 
23% of its original carbon-14. For how many years has the wood been buried? 
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Exercise 2.7.16. Suppose a radioactive element has a half-life of 24, 065 years. 
How many years will it take for a given sample to decay to the point that only 
10% of the original amount remains? 

The differential equation (2.7.40) also serves in some situations as a simple 
model for population growth. Suppose, for example, that y is the size of a 
population of a certain species of animal over a specified habitat. In the absence 
of any extraneous limits on the size of the population (such as a limitation on 
the food supply) , we would expect the rate of growth of the population to be 
proportional to the current population; that is, we would expect y to satisfy 
(2.7.40) for some constant k. In particular, if yo is the size of the population at 
some initial time and y is the size of the population t year later, then 

y = y e kt . (2.7.52) 

Now suppose we know the population is y\ at some time t\ > 0. Then 

Vi=yoe kt \ (2.7.53) 

and so, after dividing through by yo and taking the logarithm of both sides, 

fc=Ilog(^). (2.7.54) 

h \yoJ 

Example 2.7.15. Suppose that a certain habitat initially holds a population 
of 1000 deer, and that five years later the population has grown to 1200 deer. If 
we let y be the size of the population after t years, and assuming no constraints 
on the growth of the population, we would have 











y 


= 1000e fe4 , 




3 






k 


= ilog( 


'1200\ log(1.2) 

v ioooy ~ 5 




, for 


example, 


this 


model would predict a population 


of 










2/(10) = 


= 1000e 10fc 
= 1000e 21og ( 1 - 2 ' 
= lOOOeM 1 ' 22 ) 
= (1000)(1.2) 2 





= 1440 deer 
after five more years. 

If y is the size of a population of animals, we call the differential equation 

f = ky, (2.7.55) 
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and the resulting solution, 

y = y e kt , (2.7.56) 

the natural growth model. Although often relatively accurate over small time 
intervals, this model is clearly unrealistic for any extended period of time as it 
predicts an unbounded growth. Even in the best of situations, other factors, 
such as availability of food and shelter, will eventually come into play. 

The logistic model introduces a variation in the basic model (2.7.40) which 
factors in the limiting effect of space and food. In this case, we suppose that 
there is an upper bound, say M, to the size of the population which the habitat 
can sustain and that the rate of growth of the population decreases as the size 
of the population approaches this limiting value. More precisely, let k be the 
natural rate of growth of the population, that is, the rate of growth if there are 
no constraining factors, or when the size of the population is small compared 
with M, and let y be the size of the population at time t. Then we suppose k is 
decreased by a factor of 1 — jj, that is, the proportion of room left for growth. 
The resulting differential equation for the logistic model is 

ft = ky i 1 ~ m) = Ti y{M ~ v) = Pv{M ~ yl (2J ' 57) 

where (3= £. 

To solve (2.7.57), we first rewrite it using s for the independent variable, 
that is, as 

^ = 0y(M-y), (2.7.58) 

and then divide through both sides by y(M — y) to obtain 

-TTT T?=/ ? - ( 2 - 7 - 59 ) 

y(M-y)ds 

Next, we integrate both sides of this equation from to some fixed time t: 

I -TVT ^ d T ds = I P ds - (2- 7 - 60 ) 

J y(M - y) ds J M 

For the right hand side of (2.7.60), we have simply 

t 
fids = /3t. (2.7.61) 

'o 

For the left-hand side, we begin with the change of variable 

u = y 

du = -^-ds, (2.7.62) 

ds 

from which we obtain 

ft 1 



V -ds= [ — — -du, (2.7.63) 



o y( M ~ y) ds J u(M - u) 



2. 7. THE EXPONENTIAL AND LOGARITHM FUNCTIONS 



123 



where we have again used j/o to denote the size of the population at time s = 
(and noting that y is the size of the population when s = t). We assume that 
< yo < M and < y < M; that is, we assume that the populations involved 
are positive and do not exceed the maximum sustainable population. 

To evaluate (2.7.63), we rely upon a result involving what are known as 
partial fraction decompositions: There exist real numbers A and B such that 



B 



u(M-u) u M-u' 
To find A and B, we note that (2.7.64) implies that 

1 _ A(M -u) Bu A(M -u)+Bu 

u(M - u) ~ u(M - u) u(M - u) ~ u(M-u) 

It follows that, for all values of u, 

1 = A{M - u) + Bu. 



(2.7.64) 



(2.7.65) 



(2.7.66) 



In particular, when u = we have 1 = AM, and when u = M we have 1 = BM 
Hence 

A = — and B = — , 
M M' 



and so 

Hence we now have 

rv 1 

u(M - u) 



1 1 



u(M -u) M u M M -u 



(2.7.67) 
(2.7.68) 



du 



-du 



M J yo M-u 



-du 



M 
1 

M 

1 



log(u) 



M 



log(M - u) 



(log(y) - log(yo) - log(M - y) 
+ log(M-j/ )) 



( y(M-yo) 
M ° S \y Q (M-y) 



Combining (2.7.60), (2.7.61), and (2.7.69), we have 

'y(M-y ) 



(2.7.69) 



(3Mt = log 



.yo(M-y) 
which now need to solve for y. To begin, exponentiate both sides to obtain 



(2.7.70) 



o/ 3Mt = v( M - Vo) 

Vo(M - y) • 



(2.7.71) 
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It follows that 

y(M - so) = e 0Mt y o (M - y) = y o Me 0Mt - y ye^ Mt , (2.7.72) 

and so 

y Q Me (iMt = y(M - y ) + y o ye 0Mt = (M + y {e^ Mt - l))y. (2.7.73) 

Thus 

y - y^ mt (2774) 

If we divide the numerator and denominator of the right-hand side by e@ Mt , we 
have 

v= VoM. (2 7 75) 

y Me -pMt + yo _ yoe -pMt> y ■'■'°> 

from which we obtain our final form 

y° M in t w\ 

y= yo + (M-y )e-^- (2 - 7J6) 

Note that when t = 0, (2.7.76) reduces to y = yo, as it should given our initial 
condition, and when t is infinite, e~^ Mt ~ 0, and so y = M. Hence yo < y < M 
for all t, with y approaching M as t grows. 

Recalling that j3 = -^, where k is the natural rate of growth of the popula- 
tion, we may rewrite (2.7.76) as 

y° M in T T7\ 

V = — — JT7 r^a • 2.7.77 

Example 2.7.16. In our previous example, where y represented the number of 
deer in a certain habitat after t years, we found 

k _ log(1.2) 



Now suppose the habitat can support no more than 20,000 deer. Then the 
logistic model would give us 

(1000) (20000) 20000 



1000+ (20000- 1000)e fc * 1 + 19e 



z (10) = - | — _ wk fa 1409 deer, 



for the number of deer after t years. After ten years, this model would predict 
a population of 

20000 

1 + 19e- 

only slightly less than the 1440 predicted by the natural growth model. However, 
the differences between the two models become more pronounced with time. For 
example, after forty years, the natural growth model predicts 

y(40) = 1000e 40fc w 4300 deer, 
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20000 



15000 — 



10000 



r. ) 




Figure 2.7.4: Comparison of natural and logistic growth models 



while the logistic model predicts 



z(40) 



20000 



3691 deer 



1 + 19e- 40fc 
and in 100 years, the natural growth model predicts 



lOOfe 



2/(100) = lOOOe 
while the logistic model predicts only 

, , 20000 

0(100) 



1 + 19e 



-lOOfe 



38338deer, 



13373 deer. 



Of course, over time the natural growth model predicts a population which grows 
without any bound, whereas the logistic model predicts that the population, 
while always increasing, will never surpass 20, 000. See Figure 2.7.4. 



Exercise 2.7.17. Suppose a certain population of otters grows from 500 to 
600 in 4 years. Using a natural growth model, predict how many years it will 
take for the population to double. 

Exercise 2.7.18. Suppose habitat of the otters in the previous exercise can 
support no more than 1500 otters. Using a logistic model, and the natural rate 
of growth found in the previous exercise, predict how many years it will take 
for the population to double. 

In evaluating (2.7.63) we made use of a partial fraction decomposition. More 
generally, suppose p and q are polynomials, the degree of p is less than the degree 
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of q, and q factors completely into distinct linear factors, say, 

q(x) = (a\X + bi)(a,2X + b 2 ) ■ ■ ■ (a n x + b n ). (2.7.78) 

Then it may be shown that there exist consants A\, A%, . . . , A n for which 

q(x) a\X + o 2 a-ix + o 2 a n x + b n 

The evaluation of 

[ bp M dx , 

for any real numbers a and b for which q{x) ^ for all x in [a, 6], then follows 
easily. 

Example 2.7.17. To evaluate 

1 x 

„ x^A dX ' 

we first note that, since x 2 — 4 = (x — 2)(x + 2), there exist constants A and B 

for which 

x A B 



x 2 -4 x-2 x + 2 

It follows that 

x _ ^(x + 2) + S(x-2) 

x 2 - 4 x 2 - 4 

and so 

x = A(x + 2) + B(x- 2) 

for all values of x. In particular, when x = 2 we have 2 = AA, and when x = —2 
we have —2 = — AB. Hence A = ^ and B = ^, and so 

x 11 11 



x 2 -4 2x-2 2x + 2 
And so we have 

1 x . i r 1 i . i r 1 i 



dx = — I cfx H — / dx. 

^-^ — 4 2 Jq x — 2 2 Jo x + 2 



Now 



i r 1 i . i 



dx = - log(x + 2) 



= i(log(3)-log(2)), 

o z 



2 J x + 2 2 

but the first integral requires a bit more care because x — 2 < for < x < 1 . 
If we make the change of variables, 

u = -(x-2) 
du = —dx, 
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then, since x — 2 = —u, 



i r i 



-dx 



i r i 



Zi J n tC ^ .Z «/ 2 



-rfw 



1 [ 2 l 



-du 



■ log(w) 



Hence 



■log(2). 



^TT<fc = i(log(3) - log(2)) - I log(2) = l - log(3) - log(2). 



Exercise 2.7.19. Evaluate 



Exercise 2.7.20. Evaluate 



rdx. 



x + 4 



x 2 + 3x + 2 



-dx. 
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Answers to Exercises 



1.2.1. (a) 32 feet per second, (b) 64 feet per second, (c) 40 feet per second 

1.2.2. W[i.i+At] = 32 + 16At feet per second 

1.2.3. «[i.i+dt] = 32 + I6dt feet per second, v(l) = 32 feet per second 

1.2.4. v(2) = 64 feet per second 

1.2.5. v(t) = 32i feet per second 



1.3.1. Let a = r\ -\- e\ and b = r-i + £2, where r\ and r-i are real numbers 
and ei and €2 are infinitesimals. Note that a + b = (ji + r 2 ) + (ei + £2) & n d 
ab = rxr 2 + (rie 2 + ^2^1 + £i£2)- 



1.3.2. Let a = r + e, where r 7^ is a real number and e is an infinitesimal. 

Note that 

1 1 e 



a r r(r + e) 



1.5.2. (-oo,-l) and (-l,oo) 

1.5.5. (— oo,0) and (0,oo) 

1.5.6. (-oo,-l), (-1,1) and (l,oo) 
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dv 
1.6.1. -f- = 5 

ax 



1.6.2. — = 3a; 2 
dx 



1.6.3. f'(x) = 8x 



1.7.1. ^ = 2x 

ax 



1.7.2. /'(a:) 



2^ 



2x 



1.7.3. — = 16a; 
ax 



1.7.4. /'(*) 



1.7.5. — = 65a; 4 
ax 



1.7.6. /'(a;) = 20a; 3 - 6a; 



1.7.7. — = 21x 6 -3 
ax 



1.7.8. /'(a;) = 15a; 4 - 24a; 3 - 10a; 



1.7.9. 



dy 42 - 168a; 2 



dx (4a; 3 - 3a;) 2 



l^o. rw ^y 
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1.7.11. 



dy 

dx 



216 



x=l 



1.7.12. f'{x) 



V4x + 6 



1.7.13. -^ = 2Qx{x 2 + 5) 9 
ax 



1.7.14. 



cL4 

"aT 



4007T cm 2 /sec 



r=100 



1.7.15. 



dr 

~dl. 



r=15 



9?r 



centimeters per second 



1.7.16. f'(x) 



x ■> 



1.7.17. 



dy Ax 



dx (a;2 + 4)- 



1.7.18. 



f'(x) = \2(x 2 + 3x - 5) 10 (3a; 4 - 6x + 4) 11 (12:r 3 - 6)+ 

10(a; 2 + 3x- 5) 9 (3x 4 - 6x + 4) 12 (2x + 3) 



1.7.19. -j- = -3sin(3i + 6) 

-^ = -8sin 2 (t)cos(4t)sin(4i) + 2sin(t)cos(i)cos 2 (4t). 



1.7.21. -^ = 6sec 2 (3i) tan(3i) 
dx 



1.7.22. f'(t) = 6tan(3t)sec 2 (3i) 
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1.8.1. y = 90(a; - 2) + 39 



7T\ 3 

1.8.2. t , = ;l(t--)+- 



1.9.1. / is increasing on (—00, — v2] and on [v2, 00], decreasing on [— v2, v2]; 
/ has a local maximum of 4V2 at x = — v2 and a local minimum of — 4y2 at 
x = V%. 



1.9.2. / is increasing on [— 1, 0] and on [0, 1] (or, simply, [— 1, 1]; /is decreasing 
on (—00, —1] and on [1, 00); / has a local maximum of 2 at x = 1 and a local 
minimum of —2 at x = — 1. 



1.9.3. /is increasing on all intervals of the form [— nir, nir], where n = 1, 2, . . . 
/ has no local maximums or minimums (indeed, / is increasing on (—00, 00)). 



1.10.1. Maximum value of 20 at x = 4; minimum value of 12 at x = 2 



1.10.2. Maximum value of 3.4840 at t = ^; minimum value of -0.3424 at 
t = - 

1 6 



1.10.3. R is V2a by ^2b 



1.10.4. (a) all the wire is used for the circle; (b) 43.99 cm use for the square, 
56.01 cm used for the circle 



1.10.6. (1,1) 



1.10.9. (a) Hp,!) and [~,f] (b) (0,-4) 



dy x — Ay 
1.11.1. — = - 

dx Ax + y 
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1.11.2. 



dy_ 
dx 



(x )3 /) = (2,-l) 



1.11.3. y = --(x-3) + 4. 



1.11.4. y = --(x-2) + 3 



1.11.5. - cm/sec 

2 ' 



1.11.6. 234.26 miles/hour 



1.11.7. 64 cm 2 /sec 



rfx 



dx 2 



1.1 2.1 . — = 2cos(2x), ^-| = -4sin(2x), ^ = -8cos(2x) 



dx 3 



1.12.2. /'(x) 



V4x+1 



/"(*) 



4 rw 6 



(4x + 1)5 



(4x + 1)J 



1.12.3. 69.79 cm/sec 



1.12.4. Concave upward on (—00, —1) and (0, 1); concave downward on ( — 1, 0) 
and (l,Oo); Points of inflection: (-1,-2), (0,0), (1,2) 



1.12.5. Local maximum of —2 at x = — 1; local minimum of 2 at x = 1 



1.12.6. Local maximum of 2 at t = — 1; local 



minimum 



of -2 at t = 1 



2.1.1. (a) {x 2 + 3)dx = -x 3 + 3x + c 

f ! ! 

(b) / — dx = he 

J x l x 
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(c) / (3 sin(x) — 5 sec(x) tan(x))dx = —3 cos(x) — 5 sec(x) + c 

(d) / A\fx dx = 6x2 -|- c 



2.1.2. F(x) = x 5 - 2x 2 - 12 

2.1.3. x(t) = -10cos(i) + 20 

2.1.4. 64.20 meters 

2.2.1. (a) 30.11 centimeters, (b) 29.95 centimeters 



r 1 i 

2.4.1. / x 4 dx = - 
Jo 5 



2.4.2. / sm(x)dx = 2 
Jo 



2.4.3. 20 centimeters 



2.5.1. 


1 
6 


2.5.2. 


8 
3 


2.5.3. 


1 
6 


2.5.4. 





2.5.5. 


3 
2 



2.5.9. 



8tt 
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2.5.10. 



2tt 



2.5.11. 



2.5.12. 



16-4v/2 



2.5.13. 3.8153 



2.6.1. / 3x 2 \/l + x 3 dx = -(l + a; 3 )5 + c 

2.6.2. / x^4 + 3a; 2 dx = -(4 + 3:r 2 )2 + c 

2.6.3. / sec 2 (4x) tan 2 {Ax)dx = — tan 3 (4:r) + c 

r 2 x 

2.6.4. / d.T. = 9 a/9 - 9! 

7 \/4 + a? 



2.6.5. / sin(3x)d2: 



2.6.6. / sirL{2x)cos{2x)dx= — 



2.6.7. / xs'm(2x)dx = a;cos(2a;)H — sin(2:r) + c 
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/l 2 2 

X cos(3x)dx = -x sin(3a:) H — xcos(3a;) sin(3ai) + c 
o y a i 



2.6.9. / a; cos i-x ) r/.r = 2;, - 4 



2.6.10. / 2 3x 2 cos{x 2 )dx = -6tt 
Jo 

Z" 2 , , 264^3- 16 

2.6.11. / x 2 y/l + x dx= 

Jo 105 

/■IT 

2.6.12. / sin 2 (2x)dx = - 
Jo 2 



2 



2.6.13. / cos 2 (3a;)da; 
Jo 

/3 1 1 
cos (x)dx = —x H — sin(2a;) H sin(4:r) + c 
i; 84 v; 32 y ' 



a, n 357T 

2.6.15. / sin 8 Or da; = 

/o 128 



2.6.16. / sm 7 {x)dx 
Jo 



2.6.18. / cos 6 (x)rfa; 
Jo 
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16 



2.6.20. / * cos 5 (2a;)(ia; 
Jo 



4 
15 



n i 

2.6.21. / sin(2x) sm(x)dx = 



2.6.22. / sin (a;) sm(2x)dx = - 
Jo 3 



f* 1 

2.6.23. / sin(3x)cos(3a;)rfa;= - 

Jo 6 



2.6.24. / cos(a;) cos(2a;)<ia; = - 
Jo 3 



/•2 

2.6.26. / \/4 - x 2 dx = 2n 

J-2 



2 7T 

2.6.27. / dx = - 

- 2 V16-X 2 6 



2.6.28. 



(i 



3 9 + a; 2 



-dx = IT 



,' 2 1 7T 

2.6.29. / -dx = - 

_i l + 4x 2 4 



2.7.1. gr'(a;) = -84e 



-7a 



2.7.2. /'(£) = (6t-3t 2 )e~ 



2.7.3. / 5e- 2a; da; 



o 



5 5 

( 

2 2 



e" 8 w 2.4992 



2.7.4. / x z e~ x dx= -(1 - e) ss 0.2107 
Jo 3 



2.7.5. / cc 2 e- x dx = -2e _a: - 2x6"* - a; 2 e _a: + c 
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2.7.7. f'(x) '' 



3x + 4 



2.7.8. 4- = 1 + logO + 1) 

ax 



f 2 1 
2.7.9. / — — cte = log(3) 



2 a; , 1, /16 



2.7.10. / — dx=-loL. 

_ 1 3x 2 + 4 6 b \7 



2 Q 

2.7.11. / cclog(x)d:c = 21og(2)- - 



2.7.12. / y/l + x 2 dx = V2 + log(l + V2) 



i | 

2.7.13. / rfx = 2 log(l + V2 

_i VI + x 2 



2.7.14. — tt* = (log(7r))7r a 
ax 



2.7.15. 12, 149 years 

2.7.16. 79,942 years 

2.7.17. 15 years 

2.7.18. 31 years 



2.7.19. £^,= 3^(5) 
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2.7.20. 



14 



c 2 + 3a; + 2 



dx = 5 log(2) - 2 log(3) 



Index 



Arc length, 89, 88-91 
Area between curves, 79-84 

Chain rule, 33 
Continuous function, 8-23 

at a point, 9 

from the left, 10 

from the right, 10 

on a closed interval, 11 

on an open interval, 9 

polynomial, 14 

rational function, 15 

Decreasing function, 43, 43-45 
Derivative, 24 

higher-order, 57-58 

second-order, 57 
Differentiable, 26 

Euler's number, 111 

Exponential function, 112, 110-114 

Extreme- value property, 22 

Finite number, 8 
Fundamental theorem 

of calculus, 63, 72, 78 

of integrals, 77 

Half-life, 120 

Hyperreal numbers, 8, 7-8 

shadow, 8 

standard part, 8 

Implicit differentiation, 55, 52-57 
Increasing function, 43, 43-45 
Infinite number, 8 



Infinitesimal, 2, 4, 7 
Intermediate-value property, 23 

Local maximum, 43 

second-derivative test for, 61 
Local minimum, 43 

second-derivative test for, 61 
Logarithm function, 114, 114-119 
Logistic model, 122 

Mean- value theorem, 42 

Natural growth model, 122 

Optimization 

on a closed interval, 46-48 
on other intervals, 49-52 

Point of inflection, 60 
Power rule, 29, 31, 36, 118 
Product rule, 28 

Quotient rule, 32 

Rate of change, 6, 23 

average, 6 

instantaneous, 6 
Rolle's theorem, 41 

Second-derivative test, 61 
Singular point, 46 
Slope, 39 
Stationary point, 46 

Tangent line, 39 

Techniques of integration, 91-109 

change of variable, 91-95, 104-109 
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INDEX 141 

partial fractions, 123, 125-127 
parts, 95, 95-97 
reduction formulas, 98 
Trigonometric functions, 15-20 
derivatives of, 37-39 

Velocity, 4, 5 

average, 3 

instantaneous, 4 
Volumes, 84-87 



